Back to Main Conference 2026
LREC 2026main

Uhura: A Benchmark for Evaluating Scientific Question Answering and Truthfulness in Low-Resource African Languages

Proceedings of the Fifteenth Language Resources and Evaluation Conference (LREC 2026)

DOI:10.63317/43x6rqwpycuo

Abstract

Evaluations of Large Language Models (LLMs) on knowledge-intensive tasks and factual accuracy often focus on high-resource languages primarily because datasets for low-resource languages (LRLs) are scarce. In this paper, we present Uhura—a new benchmark that focuses on two tasks in six typologically-diverse African languages, created via human translation of existing English benchmarks. The first dataset, Uhura-ARC-Easy, is composed of multiple-choice science questions. The second, Uhura-TruthfulQA, is a safety benchmark testing the truthfulness of models on topics including health, law, finance, and politics. We highlight the challenges creating benchmarks with highly technical content for LRLs and outline mitigation strategies. Our evaluation reveals a significant performance gap between proprietary models such as GPT-4o and o1-preview, and Claude models, and open-source models like LLaMA and Gemma. Additionally, all models perform better in English than in African languages. These results indicate that LLMs struggle with answering scientific questions and are more prone to generating false claims in low-resource African languages. Our findings underscore the necessity for continuous improvement of multilingual LLM capabilities in LRL settings to ensure safe and reliable use in real-world contexts. We open-source the Uhura Benchmark and Uhura Platform to foster further research and development in NLP for LRLs.

Details

Paper ID
lrec2026-main-115
Pages
pp. 1485-1504
BibKey
bayes-etal-2026-uhura
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-493814-49-4
Conference
The Fifteenth Language Resources and Evaluation Conference (LREC 2026)
Location
Palma, Mallorca, Spain
Date
11 May 2026 16 May 2026

Authors

  • EB

    Edward Thomas Bayes

  • IA

    Israel Abebe Azime

  • JA

    Jesujoba Alabi

  • JK

    Jonas Kgomo

  • TE

    Tyna Eloundou

  • EP

    Elizabeth Proehl

  • KC

    Kai Chen

  • IK

    Imaan Khadir

  • NE

    Naome A. Etori

  • SM

    Shamsuddeen Hassan Muhammad

  • CM

    Choice Mpanza

  • IT

    Igneciah Pocia IP Thete

  • DK

    Dietrich Klakow

  • DA

    David Ifeoluwa Adelani

Links