HomeLREC 2026WorkshopsLLMS4SSHlrec2026-ws-llms4ssh-22
Back to LLMS4SSH 2026
LREC 2026workshop

Charting the European LLM Benchmarking Landscape: A New Taxonomy and Registry

Proceedings of Shaping Multilingual, Multimodal AI for the Social Sciences and Humanities (LLMs4SSH) @ LREC 2026

DOI:10.63317/4kixe3c9zmde

Abstract

While new benchmarks for large language models (LLMs) are being developed continuously to catch up with the growing capabilities of new models and AI in general, using and evaluating LLMs in non-English languages remains a poorly-charted landscape. We give a concise overview of recent developments in LLM benchmarking, and then propose a new taxonomy for the categorization of benchmarks that is tailored to multilingual or non-English use scenarios. We further propose a registry of benchmarks implementing the new categorization and documenting benchmarks with a rich set of metadescriptors. While still at a pilot stage, such a registry can lead to a more coordinated development of benchmarks for European languages. We conclude with a review of current trends and advocate for a higher language and culture sensitivity of evaluation methods.

Details

Paper ID
lrec2026-ws-llms4ssh-22
Pages
pp. 205-217
BibKey
vintar-etal-2026-charting
Editors
Arturo Montejo-Raez, Cristina Grisot, Joanna Blochowiak, Nikola Ljubešić, Elena Battaner, German Rigau
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of Shaping Multilingual, Multimodal AI for the Social Sciences and Humanities (LLMs4SSH) @ LREC 2026
Location
Palma, Mallorca, Spain
Date
11 - 16 May 2026

Authors

  • SV

    Spela Vintar

  • MB

    Mojca Brglez

  • TK

    Taja Kuzman Pungeršek

  • NL

    Nikola Ljubešić

Links