Back to Main Conference 2016
LREC 2016main

Cross-lingual and Supervised Models for Morphosyntactic Annotation: a Comparison on Romanian

Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)

DOI:10.63317/5bd4x3h3dqjx

Abstract

Because of the small size of Romanian corpora, the performance of a PoS tagger or a dependency parser trained with the standard supervised methods fall far short from the performance achieved in most languages. That is why, we apply state-of-the-art methods for cross-lingual transfer on Romanian tagging and parsing, from English and several Romance languages. We compare the performance with monolingual systems trained with sets of different sizes and establish that training on a few sentences in target language yields better results than transferring from large datasets in other languages.

Details

Paper ID
lrec2016-main-241
Pages
pp. 1520-1526
BibKey
aufrant-etal-2016-cross-lingual
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-9-1
Conference
Tenth International Conference on Language Resources and Evaluation
Location
Portorož, Slovenia
Date
23 May 2016 28 May 2016

Authors

  • LA

    Lauriane Aufrant

  • GW

    Guillaume Wisniewski

  • FY

    François Yvon

Links