HomeLREC 2026WorkshopsDIALRESlrec2026-ws-dialres-05
Back to DIALRES 2026
LREC 2026workshop

Phonologically-aware Automatic Speech Recognition Evaluation of Low-Resource Languages: The Case of Basque Dialects

Proceedings of the First Workshop on Dialects in NLP — A Resource Perspective

DOI:10.63317/262fznwr54us

Abstract

Automatic speech recognition models are typically trained with data of standard languages. However, their performance degrades when dealing with non-standard dialectal speech. In this paper, we present the first evaluation of an automatic speech recognition system for Basque, a low-resource language, based on spontaneous broadcast speech with high representation of dialectal speech. It relies on a 140-h manually annotated propietary corpus of television programs broadcast by Basque Radio Television, including dialect-level labels, as well as standardized and pseudo-phonetic transcriptions. We find that recognition performance significantly degrades for dialectal compared to standard speech, for all dialects present in our corpus. Subsequently, we provide a quantitative analysis of phonological phenomena based on single-word substitution errors, and identify 52 recurrent phenomena, grouped into sound deletions, epentheses, and substitutions. We further show a modest but statistically significant correlation between the number of phonological phenomena in an utterance and its recognition error rate. Our findings highlight the limitations of dialect-agnostic evaluation and motivate linguistically informed, dialect-aware strategies for automatic speech recognition in low-resource and typologically diverse languages.

Details

Paper ID
lrec2026-ws-dialres-05
Pages
pp. 48-57
BibKey
souganidis-etal-2026-phonologically
Editors
Antonis Anastasopoulos, Stella Markantonatou, Angela Ralli, Marcos Zampieri, Stavros Bompolas, Vivian Stamou
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the First Workshop on Dialects in NLP — A Resource Perspective
Location
Palma, Mallorca, Spain
Date
11 - 16 May 2026

Authors

  • CS

    Christoforos Souganidis

  • AH

    Asier Herranz

  • IS

    Ibon Saratxaga

  • EN

    Eva Navas

  • IH

    Inma Hernaez

Links