HomeLREC 2026WorkshopsSIGNLANGlrec2026-ws-signlang-15
Back to SIGNLANG 2026
LREC 2026workshop

The Construction of the CORALSE Corpus, Now and Beyond: A Tool for Documenting Spanish Sign Language

Proceedings of the LREC 2026 12th Workshop on the Representation and Processing of Sign Languages: Language in Motion

DOI:10.63317/5aiop98n7wa4

Abstract

The main objective of this paper is to present the experience of building the CORALSE corpus and to discuss the challenges that arise when attempting to provide a comprehensive description of a sign language. To this end, we address the following questions, drawing on the data obtained in the completed phases of the CORALSE project as well as on the foundational principles guiding the project’s third phase. THE CORALSE CORPUS TODAY: How have we developed a linguistic corpus of sign language?, What steps have we taken in developing the CORALSE corpus?, Which informants have we recorded and what criteria have guided their selection? THE CORALSE CORPUS IN THE FUTURE: Which (native) languages do we prioritise when selecting informants?, How do the perspectives of reference signers, interpreters, educators, and psycholinguists contribute to a more complete understanding of a sign language? Corpus linguistics is understood as a set of methodologies designed to study language through collections of digitised texts. Its development over recent decades—initially driven by advances in computing and, subsequently, by the emergence of the internet—represents one of the most significant transformations in contemporary linguistic research. The projects CORALSE: Annotated Inter-university Corpus of Spanish Sign Language and Textual Typology, Registers and Styles in Spanish Sign Language: New Data for the Expansion of the CORALSE Corpus adopt a corpus linguistics approach to collect, analyse and describe a representative sample of Spanish Sign Language (LSE). We also reflect on the types of linguistic data that are truly necessary to document the actual use of Spanish Sign Language.

Details

Paper ID
lrec2026-ws-signlang-15
Pages
pp. 140-147
BibKey
fernndezsoneira-etal-2026-construction
Editors
Eleni Efthimiou, Stavroula-Evita Fotinea, Thomas Hanke, Julie A. Hochgesang, Johanna Mesch, Marc Schulder
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the LREC 2026 12th Workshop on the Representation and Processing of Sign Languages: Language in Motion
Location
Palma, Mallorca, Spain
Date
11 - 16 May 2026

Authors

  • AF

    Ana Fernández Soneira

  • MB

    María C. Bao-Fente

  • RG

    Rayco H. González-Montesino

  • IB

    Inmaculada C. Báez-Montero

Links