DialEdit: Annotations for Spoken Conversational Image Editing (bibtex)
by Ramesh Manuvinakurike, Jacqueline Brixey, Trung Bui, Walter Chang, Ron Artstein, Kallirroi Georgila
Abstract:
We present a spoken dialogue corpus and annotation scheme for conversational image editing, where people edit an image interactively through spoken language instructions. Our corpus contains spoken conversations between two human participants: users requesting changes to images and experts performing these modiļ¬cations in real time. Our annotation scheme consists of 26 dialogue act labels covering instructions, requests, and feedback, together with actions and entities for the content of the edit requests. The corpus supports research and development in areas such as incremental intent recognition, visual reference resolution, image-grounded dialogue modeling, dialogue state tracking, and user modeling.
Reference:
DialEdit: Annotations for Spoken Conversational Image Editing (Ramesh Manuvinakurike, Jacqueline Brixey, Trung Bui, Walter Chang, Ron Artstein, Kallirroi Georgila), In Proceedings of the 14th Joint ACL - ISO Workshop on Interoperable Semantic Annotation, Association for Computational Linguistics, 2018.
Bibtex Entry:
@inproceedings{manuvinakurike_dialedit:_2018,
	address = {Santa Fe, New Mexico},
	title = {{DialEdit}: {Annotations} for {Spoken} {Conversational} {Image} {Editing}},
	url = {https://aclanthology.info/papers/W18-4701/w18-4701},
	abstract = {We present a spoken dialogue corpus and annotation scheme for conversational image editing, where people edit an image interactively through spoken language instructions. Our corpus contains spoken conversations between two human participants: users requesting changes to images and experts performing these modiļ¬cations in real time. Our annotation scheme consists of 26 dialogue act labels covering instructions, requests, and feedback, together with actions and entities for the content of the edit requests. The corpus supports research and development in areas such as incremental intent recognition, visual reference resolution, image-grounded dialogue modeling, dialogue state tracking, and user modeling.},
	booktitle = {Proceedings of the 14th {Joint} {ACL} - {ISO} {Workshop} on {Interoperable} {Semantic} {Annotation}},
	publisher = {Association for Computational Linguistics},
	author = {Manuvinakurike, Ramesh and Brixey, Jacqueline and Bui, Trung and Chang, Walter and Artstein, Ron and Georgila, Kallirroi},
	month = aug,
	year = {2018},
	keywords = {UARC, Virtual Humans}
}
Powered by bibtexbrowser