Back to Main Conference 2002
LREC 2002main

Acoustic Modeling and Training of a Bilingual ASR System when a Minority Language is Involved

Proceedings of the Third International Conference on Language Resources and Evaluation (LREC 2002)

DOI:10.63317/47cqoviag6ii

Abstract

This paper describes our work in developing a bilingual speech recognition system using two SpeechDat databases. The bilingual aspect of this work is of particular importance in the Galician region of Spain where both languages Galician and Spanish coexist and one of the languages, the Galician one, is a minority language. Based on a global  Spanish-Galician phoneme set we built a bilingual speech recognition system which can handle both languages: Spanish and Galician. The recognizer makes use of context dependent acoustic models based on continuous density hidden Markov models. The system has been evaluated on a isolated-word large-vocabulary task. The tests show that Spanish system exhibits a better performance than the Galician system due to its better training. The bilingual system provides an equivalent performance to that achieved by the language specific systems.

Details

Paper ID
lrec2002-main-016
Pages
N/A
BibKey
docio-fernandez-garcia-mateo-2002-acoustic
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
N/A
Conference
Third International Conference on Language Resources and Evaluation
Location
Las Palmas, Spain
Date
29 May 2002 31 May 2002

Authors

  • LD

    Laura Docío-Fernández

  • CG

    Carmen García-Mateo

Links