Back to Main Conference 2008
LREC 2008main

SCARE: a Situated Corpus with Annotated Referring Expressions

Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC 2008)

DOI:10.63317/28skvb4nn9cf

Abstract

Even though a wealth of speech data is available for the dialog systems research community, the particular field of situated language has yet to find an appropriate free resource. The corpus required to answer research questions related to situated language should connect world information to the human language. In this paper we report on the release of a corpus of English spontaneous instruction giving situated dialogs. The corpus was collected using the Quake environment, a first-person virtual reality game, and consists of pairs of participants completing a direction giver- direction follower scenario. The corpus contains the collected audio and video, as well as word-aligned transcriptions and the positional/gaze information of the player. Referring expressions in the corpus are annotated with the IDs of their virtual world referents.

Details

Paper ID
lrec2008-main-033
Pages
N/A
BibKey
stoia-etal-2008-scare
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-4-0
Conference
Sixth International Conference on Language Resources and Evaluation
Location
Marrakech, Morocco
Date
28 May 2008 30 May 2008

Authors

  • LS

    Laura Stoia

  • DS

    Darla Magdalene Shockley

  • DB

    Donna K. Byron

  • EF

    Eric Fosler-Lussier

Links