Back to Main Conference 2008
LREC 2008main

Comparing Italian parsers on a common Treebank: the EVALITA experience

Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC 2008)

DOI:10.63317/4j35we3newji

Abstract

The EVALITA 2007 Parsing Task has been the first contest among parsing systems for Italian. It is the first attempt to compare the approaches and the results of the existing parsing systems specific for this language using a common treebank annotated using both a dependency and a constituency-based format. The development data set for this parsing competition was taken from the Turin University Treebank, which is annotated both in dependency and constituency format. The evaluation metrics were those standardly applied in CoNLL and PARSEVAL. The results of the parsing results are very promising and higher than the state-of-the-art for dependency parsing of Italian. An analysis of such results is provided, which takes into account other experiences in treebank-driven parsing for Italian and for other Romance languages (in particular, the CoNLL X & 2007 shared tasks for dependency parsing). It focuses on the characteristics of data sets, i.e. type of annotation and size, parsing paradigms and approaches applied also to languages other than Italian.

Details

Paper ID
lrec2008-main-356
Pages
N/A
BibKey
bosco-etal-2008-comparing
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-4-0
Conference
Sixth International Conference on Language Resources and Evaluation
Location
Marrakech, Morocco
Date
28 May 2008 30 May 2008

Authors

  • CB

    Cristina Bosco

  • AM

    Alessandro Mazzei

  • VL

    Vincenzo Lombardo

  • GA

    Giuseppe Attardi

  • AC

    Anna Corazza

  • AL

    Alberto Lavelli

  • LL

    Leonardo Lesmo

  • GS

    Giorgio Satta

  • MS

    Maria Simi

Links