Back to Main Conference 2004
LREC 2004main
A Registry of Standard Data Categories for Linguistic Annotation
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004)
Abstract
In this paper we describe the most recent work within ISO TC37/SC 4, and in particular the development of a Data Category Registry (DCR) component of the Linguistic Annotation Framework. The DCR will contain a formally defined set of linguistic categories in common use within the language engineering community for reference and use in linguistically annotated resources. We outline the first proposals for creation and management of the DCR, as a solicitation for input from the community.