Back to Main Conference 2006
LREC 2006main

Structure, Annotation and Tools in the Basque ZT Corpus

Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC 2006)

DOI:10.63317/2kv8rpmmt9vn

Abstract

The ZT corpus (Basque Corpus of Science and Technology) is a tagged collection of specialized texts in Basque, which wants to be a main resource in research and development about written technical Basque: terminology, syntax and style. It will be the first written corpus in Basque which will be distributed by ELDA (at the end of 2006) and it wants to be a methodological and functional reference for new projects in the future (i.e. a national corpus for Basque). We also present the technology and the tools to build this Corpus. These tools, Corpusgile and Eulia, provide a flexible and extensible infrastructure for creating, visualizing and managing corpora and for consulting, visualizing and modifying annotations generated by linguistic tools.

Details

Paper ID
lrec2006-main-168
Pages
N/A
BibKey
areta-etal-2006-structure
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-2-4
Conference
Fifth International Conference on Language Resources and Evaluation
Location
Genoa, Italy
Date
24 May 2006 26 May 2006

Authors

  • NA

    N. Areta

  • AG

    A. Gurrutxaga

  • IL

    I. Leturia

  • ZP

    Z. Polin

  • RS

    R. Saiz

  • IA

    I. Alegria

  • XA

    X. Artola

  • AD

    A. Diaz de Ilarraza

  • NE

    N. Ezeiza

  • AS

    A. Sologaistoa

  • AS

    A. Soroa

  • AV

    A. Valverde

Links