Back to Main Conference 2022
LREC 2022main

RRGparbank: A Parallel Role and Reference Grammar Treebank

Proceedings of the Thirteenth International Conference on Language Resources and Evaluation (LREC 2022)

DOI:10.63317/2bcyye6wbw6r

Abstract

This paper describes the first release of RRGparbank, a multilingual parallel treebank for Role and Reference Grammar (RRG) containing annotations of George Orwell’s novel 1984 and its translations. The release comprises the entire novel for English and a constructionally diverse and highly parallel sample (“seed”) for German, French and Russian. The paper gives an overview of annotation decisions that have been taken and describes the adopted treebanking methodology. Finally, as a possible application, a multilingual parser is trained on the treebank data. RRGparbank is one of the first resources to apply RRG to large amounts of real-world data. Furthermore, it enables comparative and typological corpus studies in RRG. And, finally, it creates new possibilities of data-driven NLP applications based on RRG.

Details

Paper ID
lrec2022-main-517
Pages
pp. 4833-4841
BibKey
bladier-etal-2022-rrgparbank
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-38-2
Conference
Thirteenth Language Resources and Evaluation Conference
Location
Marseille, France
Date
20 June 2022 25 June 2022

Authors

  • TB

    Tatiana Bladier

  • KE

    Kilian Evang

  • VG

    Valeria Generalova

  • ZG

    Zahra Ghane

  • LK

    Laura Kallmeyer

  • RM

    Robin Möllemann

  • NM

    Natalia Moors

  • RO

    Rainer Osswald

  • SP

    Simon Petitjean

Links