Back to Main Conference 2008
LREC 2008main
Parallel Multi-Theory Annotations of Syntactic Structure
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC 2008)
Abstract
We present an approach to creating a treebank of sentences using multiple notations or linguistic theories simultaneously. We illustrate the method by annotating sentences from the Penn Treebank II in three different theories in parallel: the original PTB notation, a Functional Dependency Grammar notation, and a Government and Binding style notation. Sentences annotated with all of these theories are represented in XML as a directed acyclic graph where nodes and edges may carry extra information depending on the theory encoded.