Back to Main Conference 2026
LREC 2026main

MUNIChus: MUltilingual News Image Captioning Benchmark

Proceedings of the Fifteenth Language Resources and Evaluation Conference (LREC 2026)

DOI:10.63317/3cu2uxnphh3g

Abstract

The goal of news image captioning is to generate captions by integrating news article content with corresponding images, highlighting the relationship between textual context and visual elements. The majority of research on news image captioning focuses on English, primarily because datasets in other languages are scarce. To address this limitation, we release the first multilingual news image captioning benchmark, MUNIChus, comprising 9 languages, including several low-resource languages such as Sinhala and Urdu. We evaluate various state-of-the-art neural news image captioning models on MUNIChus and find that news image captioning remains challenging. We also make MUNIChus publicly available as a public leaderboard with over 20 models already benchmarked. We hope that MUNIChus will enable further advancements in developing and evaluating multilingual news image captioning models.

Details

Paper ID
lrec2026-main-708
Pages
pp. 9008-9017
BibKey
chen-etal-2026-munichus
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-493814-49-4
Conference
The Fifteenth Language Resources and Evaluation Conference (LREC 2026)
Location
Palma, Mallorca, Spain
Date
11 May 2026 16 May 2026

Authors

  • YC

    Yuji Chen

  • AP

    Alistair Plum

  • HH

    Hansi Hettiarachchi

  • DK

    Diptesh Kanojia

  • SB

    Saroj Basnet

  • MZ

    Marcos Zampieri

  • TR

    Tharindu Ranasinghe

Links