Back to Main Conference 2022
LREC 2022main

Cyrillic-MNIST: a Cyrillic Version of the MNIST Dataset

Proceedings of the Thirteenth International Conference on Language Resources and Evaluation (LREC 2022)

DOI:10.63317/5q5sicj588ca

Abstract

This paper presents a new handwritten dataset, Cyrillic-MNIST, a Cyrillic version of the MNIST dataset, comprising of 121,234 samples of 42 Cyrillic letters. The performance of Cyrillic-MNIST is evaluated using standard deep learning approaches and is compared to the Extended MNIST (EMNIST) dataset. The dataset is available at https://github.com/bolattleubayev/cmnist

Details

Paper ID
lrec2022-main-510
Pages
pp. 4767-4773
BibKey
tleubayev-etal-2022-cyrillic
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-38-2
Conference
Thirteenth Language Resources and Evaluation Conference
Location
Marseille, France
Date
20 June 2022 25 June 2022

Authors

  • BT

    Bolat Tleubayev

  • ZZ

    Zhanel Zhexenova

  • KK

    Kenessary Koishybay

  • AS

    Anara Sandygulova

Links