Back to Main Conference 2018
LREC 2018main

Developing New Linguistic Resources and Tools for the Galician Language

Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

DOI:10.63317/35btoyxg42fj

Abstract

In this paper we describe the work towards developing new resources and Natural Language Processing (NLP) tools for the Galician language. First, a new corpus, manually revised, for POS tagging and lemmatization is described. Second, we present a new manually annotated corpus for Named Entity tagging for Galician. Third, we train and develop new NLP tools for Galician, including the first publicly available Galician statistical modules for lemmatization and Named Entity Recognition, and new modules for POS tagging, Wikification and Named Entity Disambiguation. Finally, we also present two new Web demo applications to easily test the new set of tools online.

Details

Paper ID
lrec2018-main-367
Pages
N/A
BibKey
agerri-etal-2018-developing
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-00-9
Conference
Eleventh International Conference on Language Resources and Evaluation
Location
Miyazaki, Japan
Date
7 May 2018 12 May 2018

Authors

  • RA

    Rodrigo Agerri

  • XG

    Xavier Gómez Guinovart

  • GR

    German Rigau

  • MS

    Miguel Anxo Solla Portela

Links