HomeLREC 2026WorkshopsLLMS4SSHlrec2026-ws-llms4ssh-11
Back to LLMS4SSH 2026
LREC 2026workshop

Benchmarking LLMs for Aspect-Based Sentiment Classification in Slovene Historical Periodicals

Proceedings of Shaping Multilingual, Multimodal AI for the Social Sciences and Humanities (LLMs4SSH) @ LREC 2026

DOI:10.63317/22hvcbc23rts

Abstract

Historical newspapers present substantial challenges for computational sentiment analysis due to OCR noise, archaic linguistic features, and the absence of domain-specific labeled training data. This paper examines whether instruction-following LLMs can support targeted, mention-level sentiment inference in such conditions. We benchmark four instruction-following LLMs on a manually annotated sample of collective-identity mentions drawn from Slovene historical newspapers. The results provide a benchmark for targeted sentiment classification in OCR-degraded historical Slovene and offer an empirically grounded assessment of the capabilities and limitations of an instruction-tuned LLM in digital humanities research.

Details

Paper ID
lrec2026-ws-llms4ssh-11
Pages
pp. 103-113
BibKey
munda-etal-2026-benchmarking
Editors
Arturo Montejo-Raez, Cristina Grisot, Joanna Blochowiak, Nikola Ljubešić, Elena Battaner, German Rigau
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of Shaping Multilingual, Multimodal AI for the Social Sciences and Humanities (LLMs4SSH) @ LREC 2026
Location
Palma, Mallorca, Spain
Date
11 - 16 May 2026

Authors

  • TM

    Tina Munda

  • FD

    Filip Dobranić

  • Uroš Šmajdek

  • OP

    Oliver Pejić

  • CB

    Ciril Bohak

  • VG

    Vojko Gorjanc

  • DF

    Darja Fišer

Links