SAINT: Multilingual Span-Level Interpretability for Sentiment Analysis

Proceedings of the SIGUL 2026 Joint Workshop with ELE, EURALI, and DCLRL "Towards Inclusivity and Equality: Language Resources and Technologies for Under-Resourced and Endangered Languages

DOI:10.63317/2ph93cb2qi57

Abstract

We investigate multilingual sentiment analysis and interpretability across high- and low-resource languages, focusing on Amharic, English, German, and Hausa. Our study evaluates encoder-only transformer models for both sequence-level sentiment classification and token-level attribution using Captum. Additionally, we assess zero- and few-shot decoder-only models for sequence-level sentiment prediction. Our results show that few-shot decoder-only models outperform encoder-only models on token-level sentiment classification in most languages, with the exception of Hausa, where a multilingual encoder-based model leads. For sequence-level sentiment classification, encoder-only models generally achieve strong performance across most languages, but decoder-only models are highly competitive, and may even surpass encoders, in the high-resource settings (German, English) and low-resource scenarios, depending on the prompting strategy. These findings highlight the utility of combining fine-tuned transformer models with prompt- based large language models to build interpretable sentiment analysis systems across both low- and high-resource languages. The SAINT dataset, annotation guideline, and evaluation scripts can be found at https://github.com/uhh-hcds/SAINT.

Resources

Details

Paper ID

lrec2026-ws-sigul-06

Pages

pp. 54-65

DOI

10.63317/2ph93cb2qi57

BibKey

yimam-etal-2026-saint

Editors

Atul Kr. Ojha, Sakriani Sakti, Claudia Soria, Maite Melero, John P. McCrae, Constantine Lignos, Chao-Hong Liu, German Rigau Claramunt, Georg Rehm

Publisher

European Language Resources Association (ELRA)

ISSN

N/A

ISBN

N/A

Workshop

Proceedings of the SIGUL 2026 Joint Workshop with ELE, EURALI, and DCLRL "Towards Inclusivity and Equality: Language Resources and Technologies for Under-Resourced and Endangered Languages

Location

Palma, Mallorca, Spain

Date

11 - 16 May 2026

Authors

SY
Seid Muhie Yimam
TB
Tadesse Destaw Belay
RG
Robert Geislinger
SM
Shamsuddeen Hassan Muhammad
AO
Adaeze Ngozi Ohuoba
SI
Sukairaj Hafiz Imam
AA
Abinew Ali Ayele
MS
Martin Semmann
CB
Chris Biemann
SS
Serge Sharoff

Links

URL

DOI