HomeLREC 2026WorkshopsDETERMITlrec2026-ws-determit-08
Back to DETERMIT 2026
LREC 2026workshop

Terminology-Augmented Generation for Intangible Cultural Heritage: A Controlled LLM-Based Translation Framework

Proceedings of the 2nd Workshop on Evaluating Text Difficulty in a Multilingual Context (DeTermIt! 2026)

DOI:10.63317/43bmxwahuzmq

Abstract

This study examines the integration of a bilingual Italian–Spanish concept-oriented terminological resource into a controlled large language model (LLM) translation workflow within the domain of Campanian gastronomy. The termbase encodes structured conceptual, linguistic, and translational metadata, including grammatical information, translation strategies, and genre-sensitive usage recommendations. Through a local Model Context Protocol (MCP) architecture, the resource is dynamically connected to locally deployed LLMs, enabling the automatic identification and retrieval of relevant terminological units prior to generation. The system combines in-context terminological injection with deterministic post-processing enforcement: genre-specific policies are injected into the model prompt prior to generation and verified through a rule-based post-processing layer that enforces surface-level terminological consistency in the output. Two open-weight models — Mistral 7B Instruct and Gemma3 4B — are evaluated across three conditions and three discursive genres on a dataset of authentic texts. The findings suggest that the combination of terminological injection and deterministic enforcement can improve terminological compliance in controlled, domain-specific settings, while also highlighting differences in instruction-following behavior across models and genres.

Details

Paper ID
lrec2026-ws-determit-08
Pages
pp. 74-82
BibKey
punzizarino-etal-2026-terminology
Editors
Giorgio Maria Di Nunzio, Federica Vezzani, Liana Ermakova, Hosein Azarbonyad, Jaap Kamps
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the 2nd Workshop on Evaluating Text Difficulty in a Multilingual Context (DeTermIt! 2026)
Location
Palma, Mallorca, Spain
Date
11 - 16 May 2026

Authors

  • WP

    Wanda Punzi Zarino

  • PS

    Pilar Sánchez Gijón

Links