Back to Main Conference 2026
LREC 2026main

ESG-QA: Building a Dataset for Question Answering on Environmental, Social, and Governance Pillars

Proceedings of the Fifteenth Language Resources and Evaluation Conference (LREC 2026)

DOI:10.63317/23kmozdtq6yp

Abstract

Environmental, Social, and Governance (ESG) factors are becoming increasingly central to corporate accountability and sustainable development. However, benchmarks for evaluating large language models (LLMs) in this domain remain scarce. To alleviate this gap, we present ESG-QA, a dataset of 87,261 question–answer–context triplets spanning the three ESG pillars. ESG-QA was built using an LLM-based Question Answer (QA) generation pipeline, enhanced through rule-based and semantic filtering, and validated by human inspection, enabling both abstractive QA and retrieval-augmented setups. We benchmark three open-weight LLM families (Llama-3, Gemma-3, and Qwen-3) across multiple dimensions, including correctness, environmental impact, and readability. Results show that Qwen-3 with retrieval achieves the highest absolute QA performance, while Gemma-3 provides the strongest overall balance between correctness, efficiency, and clarity. By releasing ESG-QA and its generation framework, this work establishes a comprehensive benchmark for advancing ESG-oriented QA and promoting more transparent and responsible AI evaluation.

Details

Paper ID
lrec2026-main-420
Pages
pp. 5377-5388
BibKey
assis-etal-2026-esg
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-493814-49-4
Conference
The Fifteenth Language Resources and Evaluation Conference (LREC 2026)
Location
Palma, Mallorca, Spain
Date
11 May 2026 16 May 2026

Authors

  • GA

    Gabriel Assis

  • AS

    Ayrton Surica

  • PK

    Pedro Kroll

  • GM

    Gabriela Aires Mendes

  • DR

    Darian Rabbani

  • EB

    Edson Bollis

  • LP

    Lucas Francisco Amaral Orosco Pellicer

  • AP

    Aline Paes

Links