Back to Main Conference 2006
LREC 2006main
The SALSA Corpus: a German Corpus Resource for Lexical Semantics
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC 2006)
Abstract
This paper describes the SALSA corpus, a large German corpus manually annotated with manual role-semantic annotation, based on the syntactically annotated TIGER newspaper corpus. The first release, comprising about 20,000 annotated predicate instances (about half the TIGER corpus), is scheduled for mid-2006. In this paper we discuss the annotation framework (frame semantics) and its cross-lingual applicability, problems arising from exhaustive annotation, strategies for quality control, and possible applications.