Back to Main Conference 2022
LREC 2022main

Towards Latvian WordNet

Proceedings of the Thirteenth International Conference on Language Resources and Evaluation (LREC 2022)

DOI:10.63317/5isjq2pcdcee

Abstract

In this paper we describe our current work on creating a WordNet for Latvian based on the principles of the Princeton WordNet. The chosen methodology for word sense definition and sense linking is based on corpus evidence and the existing Tezaurs.lv online dictionary, ensuring a foundation that fits the Latvian language usage and existing linguistic tradition. We cover a wide set of semantic relations, including gradation sets. Currently the dataset consists of 6432 words linked in 5528 synsets, out of which 2717 synsets are considered fully completed as they have all the outgoing semantic links annotated, annotated with corpus examples for each sense and links to the English Princeton WordNet.

Details

Paper ID
lrec2022-main-300
Pages
pp. 2808-2815
BibKey
paikens-etal-2022-towards
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-38-2
Conference
Thirteenth Language Resources and Evaluation Conference
Location
Marseille, France
Date
20 June 2022 25 June 2022

Authors

  • PP

    Peteris Paikens

  • MG

    Mikus Grasmanis

  • AK

    Agute Klints

  • IL

    Ilze Lokmane

  • LP

    Lauma Pretkalniņa

  • LR

    Laura Rituma

  • MS

    Madara Stāde

  • LS

    Laine Strankale

Links