Back to Main Conference 2026
LREC 2026main

Text+: A National Hub Including Legacy Language Data

Proceedings of the Fifteenth Language Resources and Evaluation Conference (LREC 2026)

DOI:10.63317/4vx5d59r6m29

Abstract

Text+ is the German distributed research data infrastructure for literary studies, linguistics, and spoken and written language. Its resources consist of contemporary and historical literary and media texts, deeply annotated material, transcripts of spoken and sign language, and original recordings. Text+ provides access to its resources according to the FAIR guidelines: Findable due to standard-conformant metadata, Accessible with single sign-on authentication, Interoperable via open data formats, and Reproducible through web services and extensive documentation. The 30+ partners of Text+ are archives, libraries, universities, and other research institutions. The partners are autonomous, and they differ in the amount of data and processing capabilities they provide. In this paper, we describe the hub architecture of Text+, which gives users a central and FAIR point of access to research data that continues to be distributed across the Text+ partner institutions. The architecture serves as a blueprint to evolving research infrastructures that aim at maintaining (and empowering) their research data contributors.

Details

Paper ID
lrec2026-main-654
Pages
pp. 8264-8275
BibKey
barth-etal-2026-text
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-493814-49-4
Conference
The Fifteenth Language Resources and Evaluation Conference (LREC 2026)
Location
Palma, Mallorca, Spain
Date
11 May 2026 16 May 2026

Authors

  • FB

    Florian Barth

  • CD

    Christoph Draxler

  • JE

    Jennifer Ecker

  • SF

    Stefan Fischer

  • PG

    Philippe Genêt

  • AH

    Alina Hemmer

  • TL

    Timm Lehmberg

  • TT

    Thorsten Trippel

  • AW

    Andreas Witt

  • AZ

    Arden Zimmermann

  • CZ

    Claus Zinn

Links