Back to Main Conference 2022
LREC 2022main
Cyrillic-MNIST: a Cyrillic Version of the MNIST Dataset
Proceedings of the Thirteenth International Conference on Language Resources and Evaluation (LREC 2022)
Abstract
This paper presents a new handwritten dataset, Cyrillic-MNIST, a Cyrillic version of the MNIST dataset, comprising of 121,234 samples of 42 Cyrillic letters. The performance of Cyrillic-MNIST is evaluated using standard deep learning approaches and is compared to the Extended MNIST (EMNIST) dataset. The dataset is available at https://github.com/bolattleubayev/cmnist