Back to Main Conference 2016
LREC 2016main

Assessing the Prosody of Non-Native Speakers of English: Measures and Feature Sets

Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)

DOI:10.63317/5gjerdvgynop

Abstract

In this paper, we describe a new database with audio recordings of non-native (L2) speakers of English, and the perceptual evaluation experiment conducted with native English speakers for assessing the prosody of each recording. These annotations are then used to compute the gold standard using different methods, and a series of regression experiments is conducted to evaluate their impact on the performance of a regression model predicting the degree of naturalness of L2 speech. Further, we compare the relevance of different feature groups modelling prosody in general (without speech tempo), speech rate and pauses modelling speech tempo (fluency), voice quality, and a variety of spectral features. We also discuss the impact of various fusion strategies on performance.Overall, our results demonstrate that the prosody of non-native speakers of English as L2 can be reliably assessed using supra-segmental audio features; prosodic features seem to be the most important ones.

Details

Paper ID
lrec2016-main-211
Pages
pp. 1328-1332
BibKey
coutinho-etal-2016-assessing
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-9-1
Conference
Tenth International Conference on Language Resources and Evaluation
Location
Portorož, Slovenia
Date
23 May 2016 28 May 2016

Authors

  • EC

    Eduardo Coutinho

  • FH

    Florian Hönig

  • YZ

    Yue Zhang

  • SH

    Simone Hantke

  • AB

    Anton Batliner

  • EN

    Elmar Nöth

  • BS

    Björn Schuller

Links