Back to Main Conference 2018
LREC 2018main

Training and Adapting Multilingual NMT for Less-resourced and Morphologically Rich Languages

Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

DOI:10.63317/2xb64bmrbc6y

Abstract

In this paper, we present results of employing multilingual and multi-way neural machine translation approaches for morphologically rich languages, such as Estonian and Russian. We experiment with different NMT architectures that allow achieving state-of-the-art translation quality and compare the multi-way model performance to one-way model performance. We report improvements of up to +3.27 BLEU points over our baseline results, when using a multi-way model trained using the transformer network architecture. We also provide open-source scripts used for shuffling and combining multiple parallel datasets for training of the multilingual systems.

Details

Paper ID
lrec2018-main-595
Pages
N/A
BibKey
rikters-etal-2018-training
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-00-9
Conference
Eleventh International Conference on Language Resources and Evaluation
Location
Miyazaki, Japan
Date
7 May 2018 12 May 2018

Authors

  • MR

    Matīss Rikters

  • MP

    Mārcis Pinnis

  • RK

    Rihards Krišlauks

Links