Back to Main Conference 2014
LREC 2014main

The Extended DIRNDL Corpus as a Resource for Coreference and Bridging Resolution

Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014)

DOI:10.63317/52ho8ntgttv4

Abstract

DIRNDL is a spoken and written corpus based on German radio news, which features coreference and information-status annotation (including bridging anaphora and their antecedents), as well as prosodic information. We have recently extended DIRNDL with a fine-grained two-dimensional information status labeling scheme. We have also applied a state-of-the-art part-of-speech and morphology tagger to the corpus, as well as highly accurate constituency and dependency parsers. In the light of this development we believe that DIRNDL is an interesting resource for NLP researchers working on automatic coreference and bridging resolution. In order to enable and promote usage of the data, we make it available for download in an accessible tabular format, compatible with the formats used in the CoNLL and SemEval shared tasks on automatic coreference resolution.

Details

Paper ID
lrec2014-main-683
Pages
pp. 3222-3228
BibKey
bjorkelund-etal-2014-extended
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-8-4
Conference
Ninth International Conference on Language Resources and Evaluation
Location
Reykjavik, Iceland
Date
26 May 2014 31 May 2014

Authors

  • AB

    Anders Björkelund

  • KE

    Kerstin Eckart

  • AR

    Arndt Riester

  • NS

    Nadja Schauffler

  • KS

    Katrin Schweitzer

Links