Back to Main Conference 2002
LREC 2002main

Current Developments of STO - the Danish Lexicon Project for NLP and HLT Applications

Proceedings of the Third International Conference on Language Resources and Evaluation (LREC 2002)

DOI:10.63317/2yoi2psa6isz

Abstract

The Centre for Language Technology (Center for Sprogteknologi, CST) is in charge of a national project developing a large-scale Danish lexicon for HLT and NLP applications. The short name of the project is STO, which stands for SprogTegnologisk Ordbase  (Lexical Database for Language Technology). The project is inspired by principles and methods applied in the multilingual LE-PAROLE project (1996-98) the aim of which was to develop harmonised written language resources for 12 EU languages. The  Danish PAROLE lexicon was produced by CST and the STO project highly benefits from the experience acquired from the work  mentioned. This paper deals with a few central tasks of the ongoing project. It discusses the development of a smaller lexical  resource produced in a multilingual environment into a large-scale, monolingual resource. Two different methods of increasing the vocabulary will be presented in detail; the extension of the linguistic coverage and the refinement of the linguistic description by including more detailed language-specific information. Finally, some exploitation perspectives and the development of an internet-based user-interface will be presented. The STO project gets funding from the Danish Ministry for Science, Technology and Development for a period of three years (2001-2004).

Details

Paper ID
lrec2002-main-261
Pages
N/A
BibKey
braasch-2002-current
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
N/A
Conference
Third International Conference on Language Resources and Evaluation
Location
Las Palmas, Spain
Date
29 May 2002 31 May 2002

Authors

  • AB

    Anna Braasch

Links