Back to Main Conference 2002
LREC 2002main
Co-reference annotation and resources: A multilingual corpus of typologically diverse languages
Proceedings of the Third International Conference on Language Resources and Evaluation (LREC 2002)
Abstract
This article introduces a dialogue corpus containing data from two typologically different languages, Japanese and Kilivila. The corpus is annotated in accordance with language specific annotation schemes for co-referential and similar relations. The article describes the corpus data, the properties of language specific co-reference in the two languages and a methodology for its annotation. Examples from the corpus show how this methodology is used in the workflow of the annotation process.