Learning to Predict Pitch Accents Using Bayesian Belief Networks for Greek Language

Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004)

Abstract

Any text-to-speech (TTS) system that aims at producing understandable and natural-sounding output needs to have a module for predicting prosody. In natural speech, some words are said to be stressed, or to bear Pitch Accents (PA). Errors at this level may impede the listener in the correct understanding of the spoken utterance. Regarding the performance of data driven methods, the scale and quality of the corpus are important. Since there is no suitable corpus available for modeling Modern Greek (MG) prosody, we created a corpus consisted of 5.500 words, distributed in 500 paragraphs. In describing pitch accent in particular and intonation features in general, we use Pierrehumbert's theory adopted for MG. In the present study, we try to predict four categories of PA H*, L*, L+H* and unaccented.