Back to Main Conference 2012
LREC 2012main

Designing French Tale Corpora for Entertaining Text To Speech Synthesis

Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC 2012)

DOI:10.63317/5j2xs6irsmmi

Abstract

Text and speech corpora for training a tale telling robot have been designed, recorded and annotated. The aim of these corpora is to study expressive storytelling behaviour, and to help in designing expressive prosodic and co-verbal variations for the artificial storyteller). A set of 89 children tales in French serves as a basis for this work. The tales annotation principles and scheme are described, together with the corpus description in terms of coverage and inter-annotator agreement. Automatic analysis of a new tale with the help of this corpus and machine learning is discussed. Metrics for evaluation of automatic annotation methods are discussed. A speech corpus of about 1 hour, with 12 tales has been recorded and aligned and annotated. This corpus is used for predicting expressive prosody in children tales, above the level of the sentence.

Details

Paper ID
lrec2012-main-520
Pages
pp. 1003-1010
BibKey
doukhan-etal-2012-designing
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-7-7
Conference
Eighth International Conference on Language Resources and Evaluation
Location
Istanbul, Turkey
Date
21 May 2012 27 May 2012

Authors

  • DD

    David Doukhan

  • SR

    Sophie Rosset

  • AR

    Albert Rilliard

  • Cd

    Christophe d’Alessandro

  • MA

    Martine Adda-Decker

Links