Back to Main Conference 2004
LREC 2004main

The COST 278 MASPER Initiative - Crosslingual Speech Recognition with Large Telephone Databases

Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004)

DOI:10.63317/4u3f65hffhha

Abstract

This paper presents the work on crosslingual speech recognition carried out by the MASPER initiative that was formed as a part of the COST 278 Action. Two different approaches for transfering monolingual source acoustic models to a new language were compared. The first one was expert-driven, based on the IPA scheme. The second was data-driven, based on a crosslingual phoneme confusion matrix. German, Spanish, Hungarian and Slovak were used as sourcelanguages. Slovenian was selected to be the target language. All experiments were carried out on SpeechDat databases. The results' analysis showed that the expert-driven method outperforms the data-driven one, and that similarities between source and target language have a significant influence on the performance.

Details

Paper ID
lrec2004-main-090
Pages
N/A
BibKey
zgank-etal-2004-cost
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-1-6
Conference
Fourth International Conference on Language Resources and Evaluation
Location
Lisbon, Portugal
Date
26 May 2004 28 May 2004

Authors

  • Andrej Žgank

  • ZK

    Zdravko Kačič

  • FD

    Frank Diehl

  • KV

    Klara Vicsi

  • GS

    Gyorgy Szaszak

  • JJ

    Jozef Juhar

  • SL

    Slavomir Lihan

Links