HomeLREC 2026WorkshopsLT4HALAlrec2026-ws-lt4hala-50
Back to LT4HALA 2026
LREC 2026workshop

Neural Machine Translation for Coptic-French: Strategies for Low-Resource Ancient Languages

Proceedings of the Fourth Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA 2026) @ LREC 2026

DOI:10.63317/5asa4khfm6hq

Abstract

This paper presents the first systematic study of strategies for translating Coptic into French. Our comprehensive pipeline systematically evaluates: pivot versus direct translation, the impact of pre-training, the benefits of multi-version fine-tuning, and model robustness to noise. Utilizing aligned biblical corpora, we demonstrate that fine-tuning with a stylistically-varied and noise-aware training corpus significantly enhances translation quality. Our findings provide crucial practical insights for developing translation tools for historical languages in general.

Details

Paper ID
lrec2026-ws-lt4hala-50
Pages
pp. 482-490
BibKey
chaoui-etal-2026-neural
Editors
Rachele Sprugnoli, Marco Passarotti
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the Fourth Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA 2026) @ LREC 2026
Location
Palma, Mallorca, Spain
Date
11 - 16 May 2026

Authors

  • NC

    Nasma Chaoui

  • RK

    Richard Khoury

Links