HomeLREC 2026WorkshopsIAAIlrec2026-ws-iaai-05
Back to IAAI 2026
LREC 2026workshop

Evaluating LLMs for Detecting Demographic-Targeted Social Bias: A Comprehensive Benchmark Study

Proceedings of the Second Workshop of Identity Aware AI

DOI:10.63317/3jejtg2hfj3t

Abstract

Large-scale web-scraped text corpora used to train general-purpose AI models often contain harmful demographic-targeted social biases, creating a regulatory need for data auditing and developing scalable bias-detection methods. Although prior work has investigated biases in text datasets and related detection methods, these studies remain narrow in scope. They typically focus on a single content type (e.g., hate speech), cover limited demographic axes, overlook biases affecting multiple demographics simultaneously, and analyze limited techniques. Consequently, practitioners lack a holistic understanding of the strengths and limitations of recent large language models (LLMs) for automated bias detection. In this study, we conduct a comprehensive benchmark study on English texts to assess the ability of LLMs in detecting demographic-targeted social biases. To align with regulatory requirements, we frame bias detection as a multi-label task of detecting targeted identities using a demographic-focused taxonomy. We then systematically evaluate models across scales and techniques, including prompting, in-context learning, and fine-tuning. Using twelve datasets spanning diverse content types and demographics, our study demonstrates the promise of fine-tuned smaller models for scalable detection. However, our analyses also expose persistent gaps across identity axes and multi-demographic targeted biases, underscoring the need for more effective and scalable detection frameworks.

Details

Paper ID
lrec2026-ws-iaai-05
Pages
pp. 47-65
BibKey
majumdar-etal-2026-evaluating
Editors
A Pranav, Valerio Basile, Neele Falk, David Jurgens, Gabriella Lapesa, Anne Lauscher, Soda Marem Lo
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the Second Workshop of Identity Aware AI
Location
Palma, Mallorca, Spain
Date
11 - 16 May 2026

Authors

  • AM

    Ayan Majumdar

  • FC

    Feihao Chen

  • JL

    Jinghui Li

  • XW

    Xiaozhen Wang

Links