Back to COGALEX 2024
LREC-COLING 2024workshop

Interaction of Semantics and Morphology in Russian Word Vectors

Proceedings of the Workshop on Cognitive Aspects of the Lexicon @ LREC-COLING 2024

DOI:10.63317/3n3mbadiy2dq

Abstract

In this paper we explore how morphological information can be extracted from fastText embeddings for Russian nouns. We investigate the negative effects of syncretism and propose ways of modifying the vectors that can help to find better representations for morphological functions and thus for out of vocabulary words. In particular, we look at the effect of analysing shift vectors instead of original vectors, discuss various possibilities of finding base forms to create shift vectors, and show that using only the high frequency data is beneficial when looking for structure with respect to the morphosyntactic functions in the embeddings.

Details

Paper ID
lrec2024-ws-cogalex-14
Pages
pp. 120-128
BibKey
zinova-etal-2024-interaction
Editor
N/A
Publisher
European Language Resources Association (ELRA) and ICCL
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the Workshop on Cognitive Aspects of the Lexicon @ LREC-COLING 2024
Location
undefined, undefined
Date
20 May 2024 25 May 2024

Authors

  • YZ

    Yulia Zinova

  • Rv

    Ruben van de Vijver

  • AY

    Anastasia Yablokova

Links