HomeLREC 2026WorkshopsNLP4ECOLOGYlrec2026-ws-nlp4ecology-03
Back to NLP4ECOLOGY 2026
LREC 2026workshop

Towards Empowering Consumers through Sentence-level Readability Scoring in German ESG Reports

Proceedings of the 2nd Workshop on Ecology, Environment, and Natural Language Processing

DOI:10.63317/3jfqkwnxuso3

Abstract

With the ever-growing urgency of sustainability in the economy and society, and the massive stream of information that comes with it, consumers need reliable access to that information. To address this need, companies began publishing so called Environmental, Social, and Governance (ESG) reports, both voluntarily and forced by law. To serve the public, these reports must be addressed not only to financial experts but also to non-expert audiences. But are they written clearly enough? In this work, we extend an existing sentence-level dataset of German ESG reports with crowdsourced readability annotations. We find that, in general, native speakers perceive sentences in ESG reports as easy to read, but also that readability is subjective. We apply various readability scoring methods and evaluate them regarding their prediction error and correlation with human rankings. Our analysis shows that, while LLM prompting has potential for distinguishing clear from hard-to-read sentences, a small finetuned transformer predicts human readability with the lowest error. Averaging predictions of multiple models can slightly improve the performance at the cost of slower inference.

Details

Paper ID
lrec2026-ws-nlp4ecology-03
Pages
pp. 26-41
BibKey
schler-etal-2026-empowering
Editors
Francesca Grasso, Valerio Basile, Cristina Bosco, Muhammad Okky Ibrohim, Maria Skeppstedt, Manfred Stede
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the 2nd Workshop on Ecology, Environment, and Natural Language Processing
Location
Palma, Mallorca, Spain
Date
11 - 16 May 2026

Authors

  • BS

    Benjamin Josef Schüßler

  • JP

    Jakob Prange

Links