Lexical Conditioning of Model’s Distribution through Uncertainty-gated Soft-Mixing of Probabilities

Proceedings of the Joint Workshop on Readability and Text Simplification (READIxTSAR) @ LREC 2026

Abstract

We present Uncertainty-Gated Lexical Decoding (UGLD), a decoding-time framework for fine-grained lexical control in Large Language Models (LLMs) that explicitly addresses the trade-off between controllability and fluency. UGLD adaptively scales intervention through an entropy-based gating mechanism derived from the model’s predictive distribution, activating control when uncertainty is high and limiting interference when predictions are confident. The method supports both promotion toward and against predefined vocabularies. We evaluate UGLD in Italian on two open-weight LLMs (ANITA 8B and Qwen 3 4B) across paraphrasing and free-text generation settings, considering Simple Vocabulary Conditioning and Jargon Reduction scenarios. Automatic evaluation shows consistent improvements in lexical coverage over standard decoding strategies, while human evaluation confirms that fluency is preserved under controlled intervention.

Resources

Details

Paper ID

lrec2026-ws-readixtsar-07

Pages

pp. 89-100

DOI

10.63317/5hgiz64xsj6s

BibKey

papucci-etal-2026-lexical

Editors

Matthew Shardlow, Thomas François, Raquel Amaro, Jorge Baptista, Rémi Cardon, Eugénio Ribeiro, Horacio Saggion, Regina Stodden, Amalia Todirascu, Rodrigo Wilkens

Publisher

European Language Resources Association (ELRA)

ISSN

N/A

ISBN

N/A

Workshop

Proceedings of the Joint Workshop on Readability and Text Simplification (READIxTSAR) @ LREC 2026

Location

Palma, Mallorca, Spain

Date

11 - 16 May 2026

Authors

MP
Michele Papucci
GV
Giulia Venturi
FD
Felice Dell'Orletta

Links

URL

DOI