Back to Main Conference 2000
LREC 2000main

A Word-level Morphosyntactic Analyzer for Basque

Proceedings of the Second International Conference on Language Resources and Evaluation (LREC 2000)

DOI:10.63317/5a6yorfnqoi4

Abstract

This work presents the development and implementation of a full morphological analyzer for Basque, an agglutinative language. Several problems (phrase structure inside word-forms, noun ellipsis, multiplicity of values for the same feature and the use of complex linguistic representations) have forced us to go beyond the morphological segmentation of words, and to include an extra module that performs a full morphosyntactic parsing of each word-form. A unification-based word-level grammar has been defined for that purpose. The system has been integrated into a general environment for the automatic processing of corpora, using TEI-conformant SGML feature structures.

Details

Paper ID
lrec2000-main-033
Pages
N/A
BibKey
aduriz-etal-2000-word-level
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
N/A
Conference
Second International Conference on Language Resources and Evaluation
Location
Athens, Greece
Date
31 May 2000 2 June 2000

Authors

  • IA

    I. Aduriz

  • EA

    E. Agirre

  • IA

    I. Aldezabal

  • XA

    X. Arregi

  • JA

    J. M. Arriola

  • XA

    X. Artola

  • KG

    K. Gojenola

  • AM

    A. Maritxalar

  • KS

    K. Sarasola

  • MU

    M. Urkia

Links