Back to Main Conference 2002
LREC 2002main

Diversity of Scenarios in Information extraction

Proceedings of the Third International Conference on Language Resources and Evaluation (LREC 2002)

DOI:10.63317/2zdcy73vby7t

Abstract

This paper discusses/presents problems of template structure for Information Extraction. We investigate these problems in the context of two new Information Extraction scenarios which are linguistically and structurally more challenging than the traditional MUC scenarios. By a scenario we mean a predefined set of facts to be extracted from text. Traditional views on event structure and template design are not adequate for the more complex scenarios. We identify two structural factors that contribute to the complexity of a scenario: first, the scattering of events in text, and second, inclusion relationship between events. These factors cause difficulty in representing the facts in an unambiguous way. Traditional views on event structure and template design are not adequate for the more complex scenarios. We propose that these kinds of event relationships can be better described with a modular, hierarchical model.

Details

Paper ID
lrec2002-main-335
Pages
N/A
BibKey
huttunen-etal-2002-diversity
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
N/A
Conference
Third International Conference on Language Resources and Evaluation
Location
Las Palmas, Spain
Date
29 May 2002 31 May 2002

Authors

  • SH

    Silja Huttunen

  • RY

    Roman Yangarber

  • RG

    Ralph Grishman

Links