HomeLREC 2026WorkshopsNLP4ECOLOGYlrec2026-ws-nlp4ecology-08
Back to NLP4ECOLOGY 2026
LREC 2026workshop

Greench-v1: distilling SLMs on Greenwashing Detection

Proceedings of the 2nd Workshop on Ecology, Environment, and Natural Language Processing

DOI:10.63317/2ngqn2gr78du

Abstract

Validating greenwashing claims in environmental, social, and governance (ESG) reports relies heavily on costly and inconsistent manual review. To address this, this paper introduces Greench-v1, a low-latency small language model (based on Qwen3-4B) that screens ESG text at the paragraph level. The model outputs a three-way classification (Greenwashing Alert, No Greenwashing, Not Relevant) paired with a concise, paragraph-grounded rationale to assist human auditors in triage and validation. The system was trained on a custom dataset of roughly 2,000 paragraphs, adapted from the ClimateBERT corpus. This dataset mitigates class imbalance through controlled paraphrasing of rare positive instances and uses GPT-4o to generate evidence-based justifications. Four training regimes were evaluated: (i) Hard distillation: Supervised fine-tuning on teacher-generated outputs. (ii) Soft distillation: Training the student to match the temperature-scaled logits of a domain-specialized Qwen3-14B teacher. (iii) Group Relative Policy Optimization (GRPO): Reward-based updates driven by exact-match alert generation. (iv) Hybrid GRPO: GRPO initialized from the hard-distilled checkpoint. Distillation and efficient policy optimization significantly improved performance over untuned baselines. Soft distillation and GRPO achieved the strongest results, increasing the "Greenwashing Alert" weighted F1-score by 36.7% and 49.0%, respectively, resulting in a deployable tool for screening large volumes of ESG narratives.

Details

Paper ID
lrec2026-ws-nlp4ecology-08
Pages
pp. 79-86
BibKey
raspanti-etal-2026-greench
Editors
Francesca Grasso, Valerio Basile, Cristina Bosco, Muhammad Okky Ibrohim, Maria Skeppstedt, Manfred Stede
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the 2nd Workshop on Ecology, Environment, and Natural Language Processing
Location
Palma, Mallorca, Spain
Date
11 - 16 May 2026

Authors

  • FR

    Federico Raspanti

  • AB

    Alessandro Pietro Bardelli Bardelli

  • SS

    Simona Scala

  • İD

    İrem Demirtaş

  • MD

    Marilena Di Bari

  • MF

    Michele Filannino

Links