Back to Main Conference 2002
LREC 2002main

Designing Prosodic Databases for Automatic Modeling of Slovenian Language in a Multilingual TTS System

Proceedings of the Third International Conference on Language Resources and Evaluation (LREC 2002)

DOI:10.63317/35huqupwrrbk

Abstract

In this paper the design of a prosodic data base and the data driven prediction of phrase breaks for modeling Slovenian language in a multilingual text-to-speech (TTS) system are presented. Automatic learning techniques offer a solution in adapting prosodic models to a new language, voice or a new application, because they allow prosodic regularities to be automatically extracted from a prosodic database of natural speech. Such techniques depend on the construction of a large corpus labeled with symbolic prosody labels. The labeling can be done either automatically or by hand. While automatic labeling can be less accurate than hand labeling, the latter is very time consuming. Therefore an interactive tool for semi-automatic labeling that uses the segmented spoken counterpart of the text as input will be presented. The tool combines the advantage of hand labeling and automatic labeling by achieving a high consistency in labeling and reducing the time that would be needed for hand labeling. The labeled Slovenian corpus has been used to train our phrase break prediction module. Experiments for the data driven prediction of major and minor phrase break labels have been performed. The achieved prediction accuracy marks state-of-the art for phrase break prediction accuracy for Slovenian language.

Details

Paper ID
lrec2002-main-033
Pages
N/A
BibKey
muller-etal-2002-designing
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
N/A
Conference
Third International Conference on Language Resources and Evaluation
Location
Las Palmas, Spain
Date
29 May 2002 31 May 2002

Authors

  • AM

    Achim F. Müller

  • JS

    Janez Stergar

  • BH

    Bogomir Horvat

Links