HomeLREC 2026WorkshopsSIGNLANGlrec2026-ws-signlang-09
Back to SIGNLANG 2026
LREC 2026workshop

A Sacred Bird Called the Phoenix. Auditing the most-used Parallel Corpus for German Sign Language Recognition and Translation

Proceedings of the LREC 2026 12th Workshop on the Representation and Processing of Sign Languages: Language in Motion

DOI:10.63317/3xftynuw7i5c

Abstract

This paper presents an empirical audit of the widely used RWTH-PHOENIX-2014T corpus, examining its suitability as a benchmark for sign language recognition and translation. Through human annotation of the training set and extensive sign-to-text back translation of the test set, we provide detailed statistics that indicate substantial quality issues, including information loss and lexical errors. Automatic scores comparing human sign-to-text back translations to the original speech transcribed references are remarkably low, suggesting strong translationese effects and substantial paraphrasing, revealing limitations of lexical metrics in adequately scoring translation quality. Replacing the original speech-transcribed references with human sign-to-text back translations while scoring existing sign language translation systems reveals the lack of robustness of system evaluation with lexical metrics against this test set. Our findings highlight risks associated with relying on this corpus for model evaluation and call for more rigorous, linguistically grounded evaluation practices in sign language technology research. The back-translated test set and error annotations are made publicly available.

Details

Paper ID
lrec2026-ws-signlang-09
Pages
pp. 80-92
BibKey
czehmann-etal-2026-sacred
Editors
Eleni Efthimiou, Stavroula-Evita Fotinea, Thomas Hanke, Julie A. Hochgesang, Johanna Mesch, Marc Schulder
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the LREC 2026 12th Workshop on the Representation and Processing of Sign Languages: Language in Motion
Location
Palma, Mallorca, Spain
Date
11 - 16 May 2026

Authors

  • VC

    Vera Czehmann

  • SY

    Shakib Yazdani

  • YH

    Yasser Hamidullah

  • FN

    Fabrizio Nunnari

  • EA

    Eleftherios Avramidis

Links