Back to Main Conference 2016
LREC 2016main

CINTIL DependencyBank PREMIUM - A Corpus of Grammatical Dependencies for Portuguese

Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)

DOI:10.63317/4x6av2s74pti

Abstract

This paper presents a new linguistic resource for the study and computational processing of Portuguese. CINTIL DependencyBank PREMIUM is a corpus of Portuguese news text, accurately manually annotated with a wide range of linguistic information (morpho-syntax, named-entities, syntactic function and semantic roles), making it an invaluable resource specially for the development and evaluation of data-driven natural language processing tools. The corpus is under active development, reaching 4,000 sentences in its current version. The paper also reports on the training and evaluation of a dependency parser over this corpus. CINTIL DependencyBank PREMIUM is freely-available for research purposes through META-SHARE.

Details

Paper ID
lrec2016-main-246
Pages
pp. 1552-1557
BibKey
de-carvalho-etal-2016-cintil
Editors
Nicoletta Calzolari, Khalid Choukri, Thierry Declerck, Sara Goggi, Marko Grobelnik, Bente Maegaard, Joseph Mariani, Hélène Mazo, Asunción Moreno, Jan Odijk, Stelios Piperidis
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-9-1
Conference
Tenth International Conference on Language Resources and Evaluation
Location
Portorož, Slovenia
Date
23 - 28 May 2016

Authors

  • Rd

    Rita de Carvalho

  • AQ

    Andreia Querido

  • MC

    Marisa Campos

  • RP

    Rita Valadas Pereira

  • JS

    João Silva

  • AB

    António Branco

Links