Construction of bilingual multimodal corpora of referring expressions in collaborative problem solving

Tokunaga, Takenobu, Iida, Ryu, Yasuhara, Masaaki, Terai, Asuka, Morris, David and BELZ, ANJA (2010) Construction of bilingual multimodal corpora of referring expressions in collaborative problem solving In: 8th Workshop on Asian Language Resources, 21-22 August 2010, Coling.

[img]
Preview
PDF - Published Version
Available under License Creative Commons Attribution.

827Kb

Official URL: http://clair.eecs.umich.edu/aan/paper.php?paper_id...

Abstract

This paper presents on-going work on constructing bilingual multimodal corpora of referring expressions in collaborative problem solving for English and Japanese. The corpora were collected from dialogues in which two participants collaboratively solved Tangram puzzles with a puzzle simulator. Extra-linguistic information such as operations on puzzle pieces, mouse cursor position and piece positions were recorded in synchronisation with utterances. The speech data was transcribed and time-aligned with the extra-linguistic information. Referring expressions in utterances that refer to puzzle pieces were annotated in terms of their spans, their referents and their other attributes. The Japanese corpus has already been completed, but the English counter-part is still undergoing annotation. We have conducted a preliminary comparative analysis of both corpora, mainly with respect to task completion time, task success rates and attributes of referring expressions. These corpora showed significant differences in task completion time and success rate.

Item Type:Contribution to conference proceedings in the public domain ( Full Paper)
Subjects:G000 Computing and Mathematical Sciences > G400 Computing
Q000 Languages and Literature - Linguistics and related subjects > Q100 Linguistics
Faculties:Faculty of Science and Engineering > School of Computing, Engineering and Mathematics > ICT and Cultural Heritage
Faculty of Science and Engineering > School of Computing, Engineering and Mathematics > Natural Language Technology
ID Code:7706
Deposited By:Converis
Deposited On:30 Sep 2010 12:57
Last Modified:07 Feb 2013 03:05

Repository Staff Only: item control page