Back to Main Conference 2022
LREC 2022main

NorDiaChange: Diachronic Semantic Change Dataset for Norwegian

Proceedings of the Thirteenth International Conference on Language Resources and Evaluation (LREC 2022)

DOI:10.63317/5fqp6eqj32w8

Abstract

We describe NorDiaChange: the first diachronic semantic change dataset for Norwegian. NorDiaChange comprises two novel subsets, covering about 80 Norwegian nouns manually annotated with graded semantic change over time. Both datasets follow the same annotation procedure and can be used interchangeably as train and test splits for each other. NorDiaChange covers the time periods related to pre- and post-war events, oil and gas discovery in Norway, and technological developments. The annotation was done using the DURel framework and two large historical Norwegian corpora. NorDiaChange is published in full under a permissive licence, complete with raw annotation data and inferred diachronic word usage graphs (DWUGs).

Details

Paper ID
lrec2022-main-274
Pages
pp. 2563-2572
BibKey
kutuzov-etal-2022-nordiachange
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-38-2
Conference
Thirteenth Language Resources and Evaluation Conference
Location
Marseille, France
Date
20 June 2022 25 June 2022

Authors

  • AK

    Andrey Kutuzov

  • ST

    Samia Touileb

  • PM

    Petter Mæhlum

  • TE

    Tita Enstad

  • AW

    Alexandra Wittemann

Links