HomeLREC 2026WorkshopsSIGNLANGlrec2026-ws-signlang-19
Back to SIGNLANG 2026
LREC 2026workshop

Extracting Signs from Weakly Aligned Sign Language Corpora: A Study on LSF and LSM

Proceedings of the LREC 2026 12th Workshop on the Representation and Processing of Sign Languages: Language in Motion

DOI:10.63317/38kfot52b4dz

Abstract

This paper presents a framework for the automatic annotation of sign language data across different recording conditions, including original and interpreted content. The proposed approach integrates weak alignment, sign segmentation, and multiple instance learning with a contrastive loss. The resulting annotations are subsequently refined and filtered to enhance their reliability. Our method was applied to two historically related sign languages, French Sign Language (LSF) and Mexican Sign Language (LSM). This led to the creation of two signaries, comprising approximately 2k categories in LSF (25k occurrences) and 41 categories in LSM (1k occurrences). Both resources provide valuable support for future research in artificial intelligence and linguistics, particularly for comparative analyses between the two languages. A seminal analysis is presented as part of this paper.

Details

Paper ID
lrec2026-ws-signlang-19
Pages
pp. 174-183
BibKey
delagarza-etal-2026-extracting
Editors
Eleni Efthimiou, Stavroula-Evita Fotinea, Thomas Hanke, Julie A. Hochgesang, Johanna Mesch, Marc Schulder
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the LREC 2026 12th Workshop on the Representation and Processing of Sign Languages: Language in Motion
Location
Palma, Mallorca, Spain
Date
11 - 16 May 2026

Authors

  • Ld

    Lorena de la Garza

  • JH

    Julie Halbout

  • JL

    Julie Lascar

  • NM

    Niels Martinez

  • AC

    Arturo Curiel

  • MG

    Michèle Gouiffès

  • AB

    Annelies Braffort

Links