Back to Main Conference 2016
LREC 2016main

A Multi-domain Corpus of Swedish Word Sense Annotation

Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)

DOI:10.63317/4ujpgsneayoa

Abstract

We describe the word sense annotation layer in \emph{Eukalyptus}, a freely available five-domain corpus of contemporary Swedish with several annotation layers. The annotation uses the SALDO lexicon to define the sense inventory, and allows word sense annotation of compound segments and multiword units. We give an overview of the new annotation tool developed for this project, and finally present an analysis of the inter-annotator agreement between two annotators.

Details

Paper ID
lrec2016-main-482
Pages
pp. 3019-3022
BibKey
johansson-etal-2016-multi
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-9-1
Conference
Tenth International Conference on Language Resources and Evaluation
Location
Portorož, Slovenia
Date
23 May 2016 28 May 2016

Authors

  • RJ

    Richard Johansson

  • YA

    Yvonne Adesam

  • GB

    Gerlof Bouma

  • KH

    Karin Hedberg

Links