SimLex-999 for Modern Greek

Proceedings of the SIGUL 2026 Joint Workshop with ELE, EURALI, and DCLRL "Towards Inclusivity and Equality: Language Resources and Technologies for Under-Resourced and Endangered Languages

DOI:10.63317/2ynm43iouxft

Abstract

Human judgements of word similarity have been a core benchmark for the intrinsic evaluation of word embedding models, and continue to be used for assessing the capabilities of large language models. While word similarity benchmarks have been collected for a range of languages, none existed for Greek. We develop a Modern Greek variant of the SimLex-999 word similarity dataset by gathering similarity judgements from 90 native speakers of Greek. We then use this as a benchmark for intrinsically evaluating several Greek language models.

Resources

Details

Paper ID

lrec2026-ws-sigul-13

Pages

pp. 119-125

DOI

10.63317/2ynm43iouxft

BibKey

mylonadis-etal-2026-simlex

Editors

Atul Kr. Ojha, Sakriani Sakti, Claudia Soria, Maite Melero, John P. McCrae, Constantine Lignos, Chao-Hong Liu, German Rigau Claramunt, Georg Rehm

Publisher

European Language Resources Association (ELRA)

ISSN

N/A

ISBN

N/A

Workshop

Proceedings of the SIGUL 2026 Joint Workshop with ELE, EURALI, and DCLRL "Towards Inclusivity and Equality: Language Resources and Technologies for Under-Resourced and Endangered Languages

Location

Palma, Mallorca, Spain

Date

11 - 16 May 2026

Authors

LM
Leonidas Mylonadis
JB
Jelke Bloem

Links

URL

DOI