Back to Main Conference 2018
LREC 2018main

SB-CH: A Swiss German Corpus with Sentiment Annotations

Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

DOI:10.63317/59z46sa6988z

Abstract

We present the SB-CH corpus, a novel Swiss German corpus with annotations for sentiment analysis. It consists of more than 200,000 phrases (approx. 1 Mio tokens) from Facebook comments and online chats. Additionally, we provide sentiment annotations for almost 2000 Swiss German phrases. We describe the methodologies used in the collection and annotation of the data, and provide the first baseline results for Swiss German sentiment analysis.

Details

Paper ID
lrec2018-main-372
Pages
N/A
BibKey
grubenmann-etal-2018-sb
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-00-9
Conference
Eleventh International Conference on Language Resources and Evaluation
Location
Miyazaki, Japan
Date
7 May 2018 12 May 2018

Authors

  • RG

    Ralf Grubenmann

  • DT

    Don Tuggener

  • Pv

    Pius von Däniken

  • JD

    Jan Deriu

  • MC

    Mark Cieliebak

Links