Back to Main Conference 2008
LREC 2008main
A Unified Database of Dependency Treebanks: Integrating, Quantifying & Evaluating Dependency Data
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC 2008)
Abstract
This paper describes a database of 11 dependency treebanks which were unified by means of a two-dimensional graph format. The format was evaluated with respect to storage-complexity on the one hand, and efficiency of data access on the other hand. An example of how the treebanks can be integrated within a unique interface is given by means of the DTDB interface.