Back to Main Conference 2026
LREC 2026main

Forewarned Is Forearmed: When Non-Sequential Embedding Turns into an Anomaly Detector

Proceedings of the Fifteenth Language Resources and Evaluation Conference (LREC 2026)

DOI:10.63317/58vxg7q9649q

Abstract

This paper offers an in-depth analysis of non-sequential multimodal sentence-level embeddings, with a particular focus on the SONAR model. We demonstrate that certain embedding dimensions are sensitive to perturbations and can serve as indicators of decoding anomalies. By leveraging the consistency between successive encoding and decoding, we successfully build an accurate detector. Additionally, we explore modifying specific dimensions of interest to attempt to correct them. This work underscores the importance of understanding and analyzing the embeddings themselves to enhance the reliability of multimodal representations.

Details

Paper ID
lrec2026-main-797
Pages
pp. 10150-10156
BibKey
allesiardo-etal-2026-forewarned
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-493814-49-4
Conference
The Fifteenth Language Resources and Evaluation Conference (LREC 2026)
Location
Palma, Mallorca, Spain
Date
11 May 2026 16 May 2026

Authors

  • EA

    Elys Allesiardo

  • AC

    Antoine Caubrière

  • VV

    Valentin Vielzeuf

Links