Beyond Fake News Detection: A Community-based Study of the Multicultural Nature of Information Disorder
Proceedings of the Fifteenth Language Resources and Evaluation Conference (LREC 2026)
Abstract
Recognizing disinformation is a challenging task for humans and AI systems. News can be false, misleading, or harmful, and its interpretation often depends on the cultural context of the audience. However, existing datasets rarely account for these contextual and cultural differences, as they are typically not designed from the perspective of news consumers. To address this gap, in this paper, we present the Information Disorder (InDor) corpus, a multilingual dataset of news articles in English, Farsi, Italian, and Russian, annotated for information disorder detection and explanation. The corpus was developed through a participatory process involving contributors from diverse cultural and professional backgrounds, who engaged in data collection, annotation, and evaluation of Large Language Model (LLM) performance on the task. Our findings highlight that false and manipulated news manifest differently across cultural settings, and that current LLMs fail to adequately capture this complexity. This underscores the need for culturally aware computational approaches in the study of information disorder.
Details
Authors
- SG
Sara Gemelli
- GC
Giulia Di Cristina
- YZ
Yiran Zhang
- MH
Md Azizul Hoque
- AS
Alberto De La Torre Solís
- ME
Mohamad Mojtaba Behboudi Eshkiki
- NE
Nikolai Efimov
- ME
Mariia Everstova
- CC
Caterina Maria Cappello
- MJ
Maziar Kianimoghadam Jouneghani
- PL
Payam Latifi
- YM
Yashar Mahboudi
- FM
Farzaneh Mohseni
- DP
Dario Placenti
- TC
Tommaso Caselli
- MS
Manuela Sanguinetti
- AS
Aurora Scarpellini
- CZ
Chiara Zanchi
- UN
Usman Naseem
- MS
Marco Antonio Stranisci
- SF
Simona Frenda