HomeLREC 2026WorkshopsUDWlrec2026-ws-udw-11
Back to UDW 2026
LREC 2026workshop

CoBra: A Compound Branching Resource for Nominal Triconstituent Compounds in English and German

Proceedings of the Ninth Workshop on Universal Dependencies (UDW 2026)

DOI:10.63317/4hbr9zq3ty8c

Abstract

We present CoBra, a resource containing triconstituent nominal compounds in English and German. This addresses an understudied aspect of compound processing, since research and resources in psycholinguistics and NLP have mostly focused on two-constituent compounds. In addition, our resource covers both general and scientific language, allowing for a register-informed perspective on compounds. It provides syntactic and semantic annotation of compound structure, in particular of the branching direction (i.e. the internal embedding structure, the Compound Branching) and the semantic relationship between constituents. Annotations are implemented using extensions of Universal Dependencies (UD) labels. To explore applications of our new resource, we also conduct a pilot study investigating the relationship between semantic transparency and branching direction. Our results indicate that there is indeed a correlation. Overall, our resource contributes to gaining a more detailed understanding of the structure and processing of morphologically complex words within the UD framework.

Details

Paper ID
lrec2026-ws-udw-11
Pages
pp. 128-141
BibKey
schacht-etal-2026-cobra
Editors
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the Ninth Workshop on Universal Dependencies (UDW 2026)
Location
Palma, Mallorca, Spain
Date
11 - 16 May 2026

Authors

  • CS

    Carmen Schacht

  • IL

    Isabell Landwehr

  • DD

    Diana Davidson

  • KG

    Konrad Grabowski

  • MM

    Magdalena Meiser

  • SW

    Sophia Wiedmann

Links