HomeLREC 2026WorkshopsNLP4ECOLOGYlrec2026-ws-nlp4ecology-02
Back to NLP4ECOLOGY 2026
LREC 2026workshop

Unsupervised GRI-TCFD Alignment with LLM-Assisted Validation for Climate Disclosure and Greenwashing Risk Analysis

Proceedings of the 2nd Workshop on Ecology, Environment, and Natural Language Processing

DOI:10.63317/4pebkuscwua5

Abstract

Climate-related corporate disclosures play a central role in sustainable finance and regulatory supervision, but remain difficult to analyze due to their length, unstructured format, and strategic language. While existing NLP approaches have been applied to ESG scoring and greenwashing detection, most operate at the document level and lack explicit alignment with formal reporting standards. We propose a scalable paragraph-level framework for aligning sustainability disclosures with the Global Reporting Initiative (GRI) indicators and the Task Force on Climate-related Financial Disclosures (TCFD) pillars. Our approach combines weak supervision, climate-focused GRI-TCFD mapping, embedding-based semantic similarity, and LLM validation for climate detection. In parallel, we introduce a paragraph-level greenwashing proxy based on commitment intensity, claim specificity, and sentiment polarity. This proxy complements regulatory alignment by capturing linguistic signals associated with potentially symbolic climate communication. The resulting augmented dataset is used to fine-tune ClimateBERT models in both single-task and multi-task settings. Experimental results show that weakly supervised dataset augmentation improves robustness and generalization compared to purely manual training, with further gains in the multi-task configuration. By integrating regulatory semantics, domain-adapted language models, and scalable annotation strategies, this study advances standard-aligned climate disclosure analysis and provides tools directly relevant to climate-related financial risk assessment.

Details

Paper ID
lrec2026-ws-nlp4ecology-02
Pages
pp. 15-25
BibKey
mousaviananaraki-etal-2026-unsupervised
Editors
Francesca Grasso, Valerio Basile, Cristina Bosco, Muhammad Okky Ibrohim, Maria Skeppstedt, Manfred Stede
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the 2nd Workshop on Ecology, Environment, and Natural Language Processing
Location
Palma, Mallorca, Spain
Date
11 - 16 May 2026

Authors

  • SM

    Seyed Alireza Mousavian Anaraki

  • DC

    Danilo Croce

  • RC

    Roberta Costa

  • LT

    Luigi Tiburzi

  • AC

    Armando Calabrese

  • RB

    Roberto Basili

Links