Back to WILDRE 2024
LREC-COLING 2024workshop

Multilingual Bias Detection and Mitigation for Indian Languages

Proceedings of the 7th Workshop on Indian Language Data: Resources and Evaluation

DOI:10.63317/3isg538ddv5d

Abstract

Lack of diverse perspectives causes neutrality bias in Wikipedia content leading to millions of worldwide readers getting exposed by potentially inaccurate information. Hence, neutrality bias detection and mitigation is a critical problem. Although previous studies have proposed effective solutions for English, no work exists for Indian languages. First, we contribute two large datasets, mWIKIBIAS and mWNC, covering 8 languages, for the bias detection and mitigation tasks respectively. Next, we investigate the effectiveness of popular multilingual Transformer-based models for the two tasks by modeling detection as a binary classification problem and mitigation as a style transfer problem. We make the code and data publicly available.

Details

Paper ID
lrec2024-ws-wildre-04
Pages
pp. 24-29
BibKey
maity-etal-2024-multilingual
Editor
N/A
Publisher
European Language Resources Association (ELRA) and ICCL
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the 7th Workshop on Indian Language Data: Resources and Evaluation
Location
undefined, undefined
Date
20 May 2024 25 May 2024

Authors

  • AM

    Ankita Maity

  • AS

    Anubhav Sharma

  • RD

    Rudra Dhar

  • TA

    Tushar Abhishek

  • MG

    Manish Gupta

  • VV

    Vasudeva Varma

Links