HomeLREC 2020WorkshopsCLLRDlrec2020-ws-cllrd-6
Back to CLLRD 2020
LREC 2020workshop

The INCOMSLAV Platform: Experimental Website with Integrated Methods for Measuring Linguistic Distances and Asymmetries in Receptive Multilingualism

Proceedings of the LREC 2020 Workshop on "Citizen Linguistics in Language Resource Development"

DOI:10.63317/3b4gjtuem3h5

Abstract

We report on a web-based resource for conducting intercomprehension experiments with native speakers of Slavic languages and present our methods for measuring linguistic distances and asymmetries in receptive multilingualism. Through a website which serves as a platform for online testing, a large number of participants with different linguistic backgrounds can be targeted. A statistical language model is used to measure information density and to gauge how language users master various degrees of (un)intelligibilty. The key idea is that intercomprehension should be better when the model adapted for understanding the unknown language exhibits relatively low average distance and surprisal. All obtained intelligibility scores together with distance and asymmetry measures for the different language pairs and processing directions are made available as an integrated online resource in the form of a Slavic intercomprehension matrix (SlavMatrix).

Details

Paper ID
lrec2020-ws-cllrd-6
Pages
pp. 40-48
BibKey
stenger-etal-2020-incomslav
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the LREC 2020 Workshop on "Citizen Linguistics in Language Resource Development"
Location
undefined, undefined
Date
11 May 2020 16 May 2020

Authors

  • IS

    Irina Stenger

  • KJ

    Klara Jagrova

  • TA

    Tania Avgustinova

Links