Back to Main Conference 2022
LREC 2022main

A Free/Open-Source Morphological Analyser and Generator for Sakha

Proceedings of the Thirteenth International Conference on Language Resources and Evaluation (LREC 2022)

DOI:10.63317/4d4eueaga8mi

Abstract

We present, to our knowledge, the first ever published morphological analyser and generator for Sakha, a marginalised language of Siberia. The transducer, developed using HFST, has coverage of solidly above 90%, and high precision. In the development of the analyser, we have expanded linguistic knowledge about Sakha, and developed strategies for complex grammatical patterns. The transducer is already being used in downstream tasks, including computer assisted language learning applications for linguistic maintenance and computational linguistic shared tasks.

Details

Paper ID
lrec2022-main-550
Pages
pp. 5137-5142
BibKey
ivanova-etal-2022-free
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-38-2
Conference
Thirteenth Language Resources and Evaluation Conference
Location
Marseille, France
Date
20 June 2022 25 June 2022

Authors

  • SI

    Sardana Ivanova

  • JW

    Jonathan Washington

  • FT

    Francis Tyers

Links