Back to Main Conference 2004
LREC 2004main

Duration Modeling For Turkish Text-to-Speech Synthesis System

Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004)

DOI:10.63317/4p9taobj665f

Abstract

Naturalness of synthetic speech depends on appropriate modeling of prosodic aspects. Mostly, three prosody components are modeled: segmental duration, pitch contour and intensity. In this study, we present our work on modeling segmental duration in Turkish by using machine-learning algorithms. The models predict phone durations based on attributes such as phone identity, neighboring phone identities, lexical stress, position of syllable in word, part-of-speech information, word length in number of syllables and position of word in utterance. Obtained models predict segment durations better than mean duration approximations.

Details

Paper ID
lrec2004-main-048
Pages
N/A
BibKey
ozturk-etal-2004-duration
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-1-6
Conference
Fourth International Conference on Language Resources and Evaluation
Location
Lisbon, Portugal
Date
26 May 2004 28 May 2004

Authors

  • ÖÖ

    Özlem Öztürk

  • ÖS

    Özgul Salor

  • Tolga Çiloğlu

  • MD

    Mubeccel Demirekler

Links