Back to Main Conference 2016
LREC 2016main

CINTIL DependencyBank PREMIUM - A Corpus of Grammatical Dependencies for Portuguese

Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)

DOI:10.63317/4x6av2s74pti

Abstract

This paper presents a new linguistic resource for the study and computational processing of Portuguese. CINTIL DependencyBank PREMIUM is a corpus of Portuguese news text, accurately manually annotated with a wide range of linguistic information (morpho-syntax, named-entities, syntactic function and semantic roles), making it an invaluable resource specially for the development and evaluation of data-driven natural language processing tools. The corpus is under active development, reaching 4,000 sentences in its current version. The paper also reports on the training and evaluation of a dependency parser over this corpus. CINTIL DependencyBank PREMIUM is freely-available for research purposes through META-SHARE.

Details

Paper ID
lrec2016-main-246
Pages
pp. 1552-1557
BibKey
de-carvalho-etal-2016-cintil
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-9-1
Conference
Tenth International Conference on Language Resources and Evaluation
Location
Portorož, Slovenia
Date
23 May 2016 28 May 2016

Authors

  • Rd

    Rita de Carvalho

  • AQ

    Andreia Querido

  • MC

    Marisa Campos

  • RP

    Rita Valadas Pereira

  • JS

    João Silva

  • AB

    António Branco

Links