Back to Main Conference 2012
LREC 2012main
HamleDT: To Parse or Not to Parse?
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC 2012)
Abstract
We propose HamleDT ― HArmonized Multi-LanguagE Dependency Treebank. HamleDT is a compilation of existing dependency treebanks (or dependency conversions of other treebanks), transformed so that they all conform to the same annotation style. While the license terms prevent us from directly redistributing the corpora, most of them are easily acquirable for research purposes. What we provide instead is the software that normalizes tree structures in the data obtained by the user from their original providers.