Back to Main Conference 2006
LREC 2006main
Talbanken05: A Swedish Treebank with Phrase Structure and Dependency Annotation
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC 2006)
Abstract
We introduce Talbanken05, a Swedish treebank based on a syntactically annotated corpus from the 1970s, Talbanken76, converted to modern formats. The treebank is available in three different formats, besides the original one: two versions of phrase structure annotation and one dependency-based annotation, all of which are encoded in XML. In this paper, we describe the conversion process and exemplify the available formats. The treebank is freely available for research and educational purposes.