Back to PARLACLARIN 2024
LREC-COLING 2024workshop

Automated Emotion Annotation of Finnish Parliamentary Speeches Using GPT-4

Proceedings of the IV Workshop on Creating, Analysing, and Increasing Accessibility of Parliamentary Corpora (ParlaCLARIN) @ LREC-COLING 2024

DOI:10.63317/3c7puigbtnjv

Abstract

In this paper, we test the efficacy of using GPT-4 to annotate a dataset that is the used to train a BERT classifier for emotion analysis. Manual data annotation is often a laborious and expensive task and emotion annotation, specifically, has proved difficult even for expert annotators. We show that using GPT-4 can produce equally good results as doing data annotation manually while saving a lot of time and money. We train a BERT classifier on our automatically annotated dataset and get results that outperform a BERT classifier that is trained on machine translated data. Our paper shows how Large Language Models can be used to work with and analyse parliamentary corpora.

Details

Paper ID
lrec2024-ws-parlaclarin-11
Pages
pp. 70-76
BibKey
tarkka-etal-2024-automated
Editor
N/A
Publisher
European Language Resources Association (ELRA) and ICCL
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the IV Workshop on Creating, Analysing, and Increasing Accessibility of Parliamentary Corpora (ParlaCLARIN) @ LREC-COLING 2024
Location
undefined, undefined
Date
20 May 2024 25 May 2024

Authors

  • OT

    Otto Tarkka

  • JK

    Jaakko Koljonen

  • MK

    Markus Korhonen

  • JL

    Juuso Laine

  • KM

    Kristian Martiskainen

  • KE

    Kimmo Elo

  • VL

    Veronika Laippala

Links