Back to PARLACLARIN 2024
LREC-COLING 2024workshop

Multilingual Power and Ideology identification in the Parliament: a reference dataset and simple baselines

Proceedings of the IV Workshop on Creating, Analysing, and Increasing Accessibility of Parliamentary Corpora (ParlaCLARIN) @ LREC-COLING 2024

DOI:10.63317/2r35rotny5et

Abstract

We introduce a dataset on political orientation and power position identification. The dataset is derived from ParlaMint, a set of comparable corpora of transcribed parliamentary speeches from 29 national and regional parliaments. We introduce the dataset, provide the reasoning behind some of the choices during its creation, present statistics on the dataset, and, using a simple classifier, some baseline results on predicting political orientation on the left-to-right axis, and on power position identification, i.e., distinguishing between the speeches delivered by governing coalition party members from those of opposition party members.

Details

Paper ID
lrec2024-ws-parlaclarin-14
Pages
pp. 94-100
BibKey
coltekin-etal-2024-multilingual
Editor
N/A
Publisher
European Language Resources Association (ELRA) and ICCL
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the IV Workshop on Creating, Analysing, and Increasing Accessibility of Parliamentary Corpora (ParlaCLARIN) @ LREC-COLING 2024
Location
undefined, undefined
Date
20 May 2024 25 May 2024

Authors

  • ÇÇ

    Çağrı Çöltekin

  • MK

    Matyáš Kopp

  • MK

    Meden Katja

  • VM

    Vaidas Morkevicius

  • NL

    Nikola Ljubešić

  • TE

    Tomaž Erjavec

Links