Back to Main Conference 2026
LREC 2026main

In-Distribution Steering: Balancing Control and Coherence in Language Model Generation

Proceedings of the Fifteenth Language Resources and Evaluation Conference (LREC 2026)

DOI:10.63317/4629fxavjicu

Abstract

Activation steering methods control large language model (LLM) behavior by modifying internal activations at inference time. However, most existing activation steering methods rely on a fixed steering strength, leading to either insufficient control or unadapted intervention that degrades text plausibility and coherence. We introduce In-Distribution Steering (IDS), a novel method that adapts steering strength based on the input data distribution in representation space. IDS dynamically adjusts interventions according to how far a given input lies within the distribution, enabling adaptive intervention and generation stability during text generation. Experiments demonstrate that IDS achieves strong accuracy on classification tasks while producing coherent text without collapse, making IDS particularly well suited for real-world applications.

Details

Paper ID
lrec2026-main-163
Pages
pp. 2076-2089
BibKey
vogels-etal-2026-distribution
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-493814-49-4
Conference
The Fifteenth Language Resources and Evaluation Conference (LREC 2026)
Location
Palma, Mallorca, Spain
Date
11 May 2026 16 May 2026

Authors

  • AV

    Arthur Vogels

  • BW

    Benjamin Wong

  • YC

    Yann Choho

  • AB

    Annabelle Blangero

  • MB

    Milan Bhan

Links