Back to Main Conference 2012
LREC 2012main

MLSA — A Multi-layered Reference Corpus for German Sentiment Analysis

Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC 2012)

DOI:10.63317/279fqb22ta9a

Abstract

In this paper, we describe MLSA, a publicly available multi-layered reference corpus for German-language sentiment analysis. The construction of the corpus is based on the manual annotation of 270 German-language sentences considering three different layers of granularity. The sentence-layer annotation, as the most coarse-grained annotation, focuses on aspects of objectivity, subjectivity and the overall polarity of the respective sentences. Layer 2 is concerned with polarity on the word- and phrase-level, annotating both subjective and factual language. The annotations on Layer 3 focus on the expression-level, denoting frames of private states such as objective and direct speech events. These three layers and their respective annotations are intended to be fully independent of each other. At the same time, exploring for and discovering interactions that may exist between different layers should also be possible. The reliability of the respective annotations was assessed using the average pairwise agreement and Fleiss' multi-rater measures. We believe that MLSA is a beneficial resource for sentiment analysis research, algorithms and applications that focus on the German language.

Details

Paper ID
lrec2012-main-013
Pages
pp. 3551-3556
BibKey
clematide-etal-2012-mlsa
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-7-7
Conference
Eighth International Conference on Language Resources and Evaluation
Location
Istanbul, Turkey
Date
21 May 2012 27 May 2012

Authors

  • SC

    Simon Clematide

  • SG

    Stefan Gindl

  • MK

    Manfred Klenner

  • SP

    Stefanos Petrakis

  • RR

    Robert Remus

  • JR

    Josef Ruppenhofer

  • UW

    Ulli Waltinger

  • MW

    Michael Wiegand

Links