Back to Main Conference 2024
LREC-COLING 2024main

Universal Dependencies: Extensions for Modern and Historical German

Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

DOI:10.63317/2apib3q8yd39

Abstract

In this paper we present extensions of the UD scheme for modern and historical German. The extensions relate in part to fundamental differences such as those between different kinds of arguments and modifiers. We illustrate the extensions with examples from the MHG data and discuss a number of MHG-specific constructions. At the current time, we have annotated a corpus of Middle High German with almost 29K tokens using this scheme, which to our knowledge is the first UD treebank for Middle High German. Inter-annotator agreement is very high: the annotators achieve a score of α = 0.85. A statistical analysis of the annotations shows some interesting differences in the distribution of labels between modern and historical German.

Details

Paper ID
lrec2024-main-1485
Pages
pp. 17101-17111
BibKey
dipper-etal-2024-universal
Editor
N/A
Publisher
European Language Resources Association (ELRA) and ICCL
ISSN
2522-2686
ISBN
979-10-95546-34-4
Conference
Joint International Conference on Computational Linguistics, Language Resources and Evaluation
Location
Turin, Italy
Date
20 May 2024 25 May 2024

Authors

  • SD

    Stefanie Dipper

  • CH

    Cora Haiber

  • AS

    Anna Maria Schröter

  • AW

    Alexandra Wiemann

  • MB

    Maike Brinkschulte

Links