Back to Main Conference 2004
LREC 2004main

Creation of Reusable Components and Language Resources for Named Entity Recognition in Russian

Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004)

DOI:10.63317/46yfx2ijf4z8

Abstract

This paper describes the development of the RussIE system in which we experimented with the creation of reusable processing components and language resources for a Russian Information Extraction system. The work was done as part of a multilingual project to adapt existing tools and resources for HLT to new domains and languages. The system was developed within the GATE architecture for language processing, and aims to explore the boundaries of language resource reuse and adaptability across languages and language types, rather than to create a full-scale IE system at the very peak of performance. Nevertheless, the systgem achieves a very creditable 71% F-Measure on news texts, and there is much scope for future improvement of this score.

Details

Paper ID
lrec2004-main-138
Pages
N/A
BibKey
popov-etal-2004-creation
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-1-6
Conference
Fourth International Conference on Language Resources and Evaluation
Location
Lisbon, Portugal
Date
26 May 2004 28 May 2004

Authors

  • BP

    Borislav Popov

  • AK

    Angel Kirilov

  • DM

    Diana Maynard

  • DM

    Dimitar Manov

Links