HomeLREC 2026WorkshopsLT4HALAlrec2026-ws-lt4hala-09
Back to LT4HALA 2026
LREC 2026workshop

Domain-Aware Error Correction for Citation NER in Medieval Hebrew Responsa

Proceedings of the Fourth Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA 2026) @ LREC 2026

DOI:10.63317/5euewaq3i8b5

Abstract

Citation identification in historical and ancient texts poses challenges that extend beyond surface-level pattern recognition, including implicit references, morphological fusion, and discourse-driven ambiguity. In this work, we address citation Named Entity Recognition (NER) in medieval Hebrew Responsa literature using a modular, LLM-based correction pipeline. Rather than treating large language models as end-to-end predictors, we leverage them as structured components: an initial prompt-based expert tagger, complementary LLM judges for systematic error detection, and domain-aware correction grounded in philological regularities. Our approach requires no end-to-end fine-tuning and only minimal labeled supervision (a small validation set for training a lightweight error-detection classifier), narrowing the performance gap to strong supervised models trained on domain-specific data. The results suggest that explicit error handling and interpretability-driven design offer a promising direction for historical NLP in low-resource settings.

Details

Paper ID
lrec2026-ws-lt4hala-09
Pages
pp. 96-105
BibKey
liebeskind-etal-2026-domain
Editors
Rachele Sprugnoli, Marco Passarotti
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the Fourth Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA 2026) @ LREC 2026
Location
Palma, Mallorca, Spain
Date
11 - 16 May 2026

Authors

  • SL

    Shmuel LIebeskind

  • MZ

    Maayan Zhitomirsky-Geffet

  • BK

    Binyamin Katzoff

  • NB

    Nati Ben-Gigi

  • JS

    Jonathan Schler

Links