Back to Main Conference 2000
LREC 2000main
Design of Optimal Slovenian Speech Corpus for Use in the Concatenative Speech Synthesis System
Proceedings of the Second International Conference on Language Resources and Evaluation (LREC 2000)
Abstract
In the paper the development of Slovenian speech corpus for use in concatenative speech synthesis system being developed at University of Maribor, Slovenia, will be presented. The emphasis in the paper is the issue of maximising the usefulness of the defined speech corpus for concatenation purposes. Usefulness of the speech corpus very much depends on the corresponding text and can be increased if the appropriate text is chosen. In the approach we used, detailed statistics of the text corpora has been done, to be able to define the sentences, rich with non-uniform units like monophones, diphones and triphones.