Back to Main Conference 2010
LREC 2010main

A Morphological Processor Based on Foma for Biscayan (a Basque dialect)

Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC 2010)

DOI:10.63317/3wd9okgdmy2q

Abstract

We present a new morphological processor for Biscayan, a dialect of Basque, developed on the description of the morphology of standard Basque. The database for the standard morphology has been extended for dialects and an open-source tool for morphological description named foma is used for building the processor. Biscayan is a dialect of the Basque language spoken mainly in Biscay, a province on the western of the Basque Country. The description of the lexicon and the morphotactics (or word grammar) for the standard Basque was carried out using a relational database and the database has been extended in order to include dialectal variants linked to the standard entries. XuxenB, a spelling checker/corrector for this dialect, is the first application of this work. Additionally to the basic analyzer used for spelling, a new transducer is included. It is an enhanced analyzer for linking standard form with the corresponding standard ones. It is used in correction for generation of proposals when in the input text appear standard forms which we want to replace with dialectal forms.

Details

Paper ID
lrec2010-main-096
Pages
N/A
BibKey
alegria-etal-2010-morphological
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-6-7
Conference
Seventh International Conference on Language Resources and Evaluation
Location
Valletta, Malta
Date
17 May 2010 23 May 2010

Authors

  • IA

    Iñaki Alegria

  • GA

    Garbiñe Aranbarri

  • KC

    Klara Ceberio

  • GL

    Gorka Labaka

  • BL

    Bittor Laskurain

  • RU

    Ruben Urizar

Links