HomeLREC 2026WorkshopsRAPID6MENTALAIlrec2026-ws-rapid6mentalai-03
Back to RAPID6MENTALAI 2026
LREC 2026workshop

Disfluencies and ASR Performance on Swedish Spontaneous Speech from the ‘Trip to Stockholm’ Discourse Narrative Task

Proceedings of the Sixth Resources and ProcessIng of linguistic, para-linguistic and extra-linguistic Data from people with various forms of cognitive/psychiatric/developmental impairments in cooperation with the MENTAL.ai consortium

DOI:10.63317/3gqmxymvjgio

Abstract

Automatic Speech Recognition (ASR) offers a scalable and cost-efficient alternative to manual transcription and is becoming increasingly relevant in clinical contexts, particularly for the detection of cognitive decline and mental health assessment. However, current ASR-systems still struggle with spontaneous speech, particularly when processing disfluencies, pauses, and speaker variability that often carry diagnostic value. This study evaluates state-of-the-art open ASR models targeting Swedish using recordings from the "Trip to Stockholm" discourse narrative task which elicits ecologically valid, cognitively demanding speech. Recognition quality is assessed using various metrics, alongside an analysis of linguistic and technical sources of error focused on disfluencies. Our findings show that disfluency-related phenomena degrade recognition performance. Possible post-processing strategies can improve specific error patterns emerging for filled pauses, word repetitions, and self-corrections. The results illustrate both the advances and ongoing limitations of ASR for spontaneous Swedish speech, emphasizing the need for models explicitly trained, or fine-tuned, on disfluent data to ensure robustness in clinical and research applications.

Details

Paper ID
lrec2026-ws-rapid6mentalai-03
Pages
pp. 24-33
BibKey
kokkinakis-etal-2026-disfluencies
Editors
Dimitrios Kokkinakis, Charalambos Themistocleous, Gaël Dias, Kathleen C. Fraser, Fredrik Öhman, Sebastião Pais
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the Sixth Resources and ProcessIng of linguistic, para-linguistic and extra-linguistic Data from people with various forms of cognitive/psychiatric/developmental impairments in cooperation with the MENTAL.ai consortium
Location
Palma, Mallorca, Spain
Date
11 - 16 May 2026

Authors

  • DK

    Dimitrios Kokkinakis

  • HL

    Herbert Lange

  • RM

    Ricardo Muñoz Sánchez

Links