Back to Main Conference 2018
LREC 2018main

Cheating a Parser to Death: Data-driven Cross-Treebank Annotation Transfer

Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

DOI:10.63317/4wv3nqvmyrmk

Abstract

We present an efficient and accurate method for transferring annotations between two different treebanks of the same language. This method led to the creation of a new instance of the French Treebank (Abeillé et al., 2003), which follows the Universal Dependency annotation scheme and which was proposed to the participants of the CoNLL 2017 Universal Dependency parsing shared task (Zeman et al., 2017). Strong results from an evaluation on our gold standard (94.75% of LAS, 99.40% UAS on the test set) demonstrate the quality of this new annotated data set and validate our approach.

Details

Paper ID
lrec2018-main-718
Pages
N/A
BibKey
seddah-etal-2018-cheating
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-00-9
Conference
Eleventh International Conference on Language Resources and Evaluation
Location
Miyazaki, Japan
Date
7 May 2018 12 May 2018

Authors

  • DS

    Djamé Seddah

  • Ed

    Eric de la Clergerie

  • BS

    Benoît Sagot

  • HM

    Héctor Martínez Alonso

  • MC

    Marie Candito

Links