HomeLREC 2026WorkshopsRESOURCEFULlrec2026-ws-resourceful-13
Back to RESOURCEFUL 2026
LREC 2026workshop

Progressing beyond Art Masterpieces or Touristic Clichés: how to assess your LLMs for cultural alignment?

The Fourth Workshop on Resources and Representations for Under-Resourced Languages and Domains (RESOURCEFUL 2026)

DOI:10.63317/37qvfzetoj3r

Abstract

Although the cultural (mis)alignment of Large Language Models (LLMs) has attracted increasing attention - often framed in terms of cultural bias - until recently there has been limited work on the design and development of datasets for cultural assessment. Here, we review existing approaches to such datasets and identify their main limitations. To address these issues, we propose design guidelines for annotators and report on the construction of a dataset built according to these principles. We further present a series of contrastive experiments conducted with this dataset. The results demonstrate that our design yields test sets with greater discriminative power, effectively distinguishing between models specialized for a given culture and those that are not, ceteris paribus.

Details

Paper ID
lrec2026-ws-resourceful-13
Pages
pp. 131-141
BibKey
branco-etal-2026-progressing
Editors
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
The Fourth Workshop on Resources and Representations for Under-Resourced Languages and Domains (RESOURCEFUL 2026)
Location
Palma, Mallorca, Spain
Date
11 - 16 May 2026

Authors

  • AB

    António Branco

  • JS

    João Ricardo Silva

  • NM

    Nuno Marques

  • LG

    Luis M. S. Gomes

  • RC

    Ricardo Campos

  • RS

    Raquel Sequeira

  • SN

    Sara Nerea

  • RS

    Rodrigo Silva

  • MM

    Miguel Marques

  • RD

    Rodrigo Duarte

  • AP

    Artur Putyato

  • DF

    Diogo Folques

  • TV

    Tiago Valente

Links