HomeLREC 2026WorkshopsPRESSMINTlrec2026-ws-pressmint-11
Back to PRESSMINT 2026
LREC 2026workshop

A Survey of the Digitisation of German Newspapers in interwar Lithuania (1918–1940)

Proceedings of the First Workshop on Creating Interoperable Corpora of Historical Newspapers

DOI:10.63317/2tcpjmdfqgis

Abstract

This paper presents a survey of the preservation and digitisation status of the German-language press published in interwar Lithuania which existed between 1918 and 1940. Produced within this newly established and ethnically diverse republic which was operating in accordance with the European Minority Protection Regime, German-language newspapers and other periodicals formed a relevant part of the country’s multilingual press. They represent an interesting yet underexplored resource for historical and linguistic research. The survey summarises bibliographic information and the results of earlier digitisation projects. It further addresses challenges for optical character recognition (OCR) of newspaper facsimiles. Although systematic digitisation remains future work, the paper identifies major challenges for OCR within this collection in particular in relation to typographic variation.

Details

Paper ID
lrec2026-ws-pressmint-11
Pages
pp. 65-71
BibKey
plauinaityt-etal-2026-survey
Editors
Maciej Ogrodniczuk, Petya Osenova, Tanja Wissik
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the First Workshop on Creating Interoperable Corpora of Historical Newspapers
Location
Palma, Mallorca, Spain
Date
11 - 16 May 2026

Authors

  • LP

    Lina Plaušinaitytė

  • HZ

    Heike Zinsmeister

Links