HomeLREC 2026WorkshopsDIALRESlrec2026-ws-dialres-18
Back to DIALRES 2026
LREC 2026workshop

Speaker Normalization via Voice Conversion Reveals a Human-Machine Dissociation in Dialect Classification

Proceedings of the First Workshop on Dialects in NLP — A Resource Perspective

DOI:10.63317/3sqk7nxsikhp

Abstract

This study evaluates whether Retrieval-based Voice Conversion (RVC) can be used to normalize speaker-specific variability while preserving dialect-relevant acoustic cues, and what the response of human and machine systems to this manipulation reveals about the architecture of dialect recognition. In two perception experiments, speech samples from nine German dialect regions were presented either in their original form or after conversion to a single target speaker. We compared overall accuracy, confusion structures, item-level response distributions, and the interaction between listener origin and target dialect across conditions. Human classification remained stable under voice conversion. Accuracy did not differ between conditions, confusion matrices were highly correlated, and item-level divergences were minimal. The interaction between listener origin and target dialect—reflecting systematic regional bias—remained invariant. These findings indicate that RVC does not distort perceptually relevant dialectal cues and that human dialect recognition is robust to speaker normalization. In contrast, we evaluated a deep learning model under matched conditions: model accuracy improved significantly under RVC, while human performance remained unchanged. This dissociation reframes RVC as an experimental probe for investigating the divergence between human and machine speech processing, suggesting that this divergence is rooted in fundamentally different representational architectures.

Details

Paper ID
lrec2026-ws-dialres-18
Pages
pp. 177-187
BibKey
kleen-etal-2026-speaker
Editors
Antonis Anastasopoulos, Stella Markantonatou, Angela Ralli, Marcos Zampieri, Stavros Bompolas, Vivian Stamou
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the First Workshop on Dialects in NLP — A Resource Perspective
Location
Palma, Mallorca, Spain
Date
11 - 16 May 2026

Authors

  • CK

    Caroline Kleen

  • LF

    Lea Fischbach

  • AK

    Akbar Karimi

  • LF

    Lucie Flek

  • AL

    Alfred Lameli

Links