Back to Main Conference 2010
LREC 2010main

LIMA : A Multilingual Framework for Linguistic Analysis and Linguistic Resources Development and Evaluation

Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC 2010)

DOI:10.63317/3msmkgx3fqda

Abstract

The increasing amount of available textual information makes necessary the use of Natural Language Processing (NLP) tools. These tools have to be used on large collections of documents in different languages. But NLP is a complex task that relies on many processes and resources. As a consequence, NLP tools must be both configurable and efficient: specific software architectures must be designed for this purpose. We present in this paper the LIMA multilingual analysis platform, developed at CEA LIST. This configurable platform has been designed to develop NLP based industrial applications while keeping enough flexibility to integrate various processes and resources. This design makes LIMA a linguistic analyzer that can handle languages as different as French, English, German, Arabic or Chinese. Beyond its architecture principles and its capabilities as a linguistic analyzer, LIMA also offers a set of tools dedicated to the test and the evaluation of linguistic modules and to the production and the management of new linguistic resources.

Details

Paper ID
lrec2010-main-370
Pages
N/A
BibKey
besancon-etal-2010-lima
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-6-7
Conference
Seventh International Conference on Language Resources and Evaluation
Location
Valletta, Malta
Date
17 May 2010 23 May 2010

Authors

  • RB

    Romaric Besançon

  • Gd

    Gaël de Chalendar

  • OF

    Olivier Ferret

  • FG

    Faiza Gara

  • OM

    Olivier Mesnard

  • ML

    Meriama Laïb

  • NS

    Nasredine Semmar

Links