Back to Main Conference 2026
LREC 2026main

Universal NER v2: Towards a Massively Multilingual Named Entity Recognition Benchmark

Proceedings of the Fifteenth Language Resources and Evaluation Conference (LREC 2026)

DOI:10.63317/4qhcjikvgeda

Abstract

We present Universal NER (UNER) v2, a significant extension of the initial version released in 2024. UNER is a collaborative dataset for multilingual named-entity annotations, built to support research on NER methods in a cross-linguistic setting. UNER v2 adds 11 new datasets in 10 typologically varied languages to the resource, including multiple parallel evaluation benchmarks aligned with each other and other datasets in UNER v1, while maintaining the same annotation guidelines and high standards for inter-annotator agreement. We report detailed statistics for the dataset and benchmark UNER v2 using both encoder-based model architectures and LLMs.

Details

Paper ID
lrec2026-main-525
Pages
pp. 6609-6618
BibKey
blevins-etal-2026-universal
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-493814-49-4
Conference
The Fifteenth Language Resources and Evaluation Conference (LREC 2026)
Location
Palma, Mallorca, Spain
Date
11 May 2026 16 May 2026

Authors

  • TB

    Terra Blevins

  • SM

    Stephen Mayhew

  • MS

    Marek Suppa

  • HG

    Hila Gonen

  • SM

    Shachar Mirkin

  • VP

    Vasile Pais

  • KD

    Kaja Dobrovoljc

  • VG

    Voula Giouli

  • JK

    Jun Kevin

  • EJ

    Eugene Jang

  • EK

    Eungseo Kim

  • JS

    Jeongyeon Seo

  • XG

    Xenophon Gialis

  • YP

    Yuval Pinter

Links