Title

Parallel corpora for the Galician language: building and processing of the CLUVI (Linguistic Corpus of the University of Vigo)

Author(s)

Xavier Gómez-Guinovart, Elena Sacau Fontenla

SLI (Computational Linguistics Group of the University of Vigo)

Session

 

Abstract

In this paper, we present the methodology developed by the SLI (Computational Linguistics Group of the University of Vigo) for the building and processing of the CLUVI Corpus, showing the TMX-based XML specification designed to encode both morphosyntactic features and translation alignments in parallel corpora, and the solutions adopted for making the CLUVI parallel corpora freely available over the WWW (http://sli.uvigo.es/CLUVI/).

Keyword(s)

TMX, XML, parallel corpora, translation, Galician

Language(s)

Galician

Full Paper

290.pdf