Back to Main Conference 2008
LREC 2008main

Linguistic Resources and Evaluation Techniques for Evaluation of Cross-Document Automatic Content Extraction

Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC 2008)

DOI:10.63317/23nqo4rf4ozi

Abstract

The NIST Automatic Content Extraction (ACE) Evaluation expands its focus in 2008 to encompass the challenge of cross-document and cross-language global integration and reconciliation of information. While past ACE evaluations have been limited to local (within-document) detection and disambiguation of entities, relations and events, the current evaluation adds global (cross-document and cross-language) entity disambiguation tasks for Arabic and English. This paper presents the 2008 ACE XDoc evaluation task and associated infrastructure. We describe the linguistic resources created by LDC to support the evaluation, focusing on new approaches required for data selection, data processing, annotation task definitions and annotation software, and we conclude with a discussion of the metrics developed by NIST to support the evaluation.

Details

Paper ID
lrec2008-main-390
Pages
N/A
BibKey
strassel-etal-2008-linguistic
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-4-0
Conference
Sixth International Conference on Language Resources and Evaluation
Location
Marrakech, Morocco
Date
28 May 2008 30 May 2008

Authors

  • SS

    Stephanie Strassel

  • MP

    Mark Przybocki

  • KP

    Kay Peterson

  • ZS

    Zhiyi Song

  • KM

    Kazuaki Maeda

Links