Back to Main Conference 2002
LREC 2002main
Bootstrapping Large Sense Tagged Corpora
Proceedings of the Third International Conference on Language Resources and Evaluation (LREC 2002)
Abstract
The performance of Word Sense Disambiguation systems largely depends on the availability of sense tagged corpora. Since the semantic annotations are usually done by humans, the size of such corpora is limited to a handful of tagged texts. This paper proposes a generation algorithm that may be used to automatically create large sense tagged corpora. The approach is evaluated through comparative sense disambiguation experiments performed on data provided during the SENSEVAL-2