Back to Main Conference 2002
LREC 2002main

Preliminary Evaluation of Slovenian Mobile Database PoliDat

Proceedings of the Third International Conference on Language Resources and Evaluation (LREC 2002)

DOI:10.63317/2epsufmjt93y

Abstract

The following paper describes the preliminary speech recognition evaluation of PoliDat database. This new database contains Slovenian speech captured over mobile telephones.  The design of database is modeled according to the SpeechDat(II) specifications. The recording of speech material and the format of the database are shortly described. The speech recognition experiment is based on slightly modified COST 249 refrec0.96 script. Acoustic HMM speech models are trained on the fixed telephone Slovenian 1000 FDB SpeechDat(II) database. 40 speakers were taken from mobile PoliDat database, 20 for test set and 20 for adaptation set. First the signal to noise ratio of all recordings was calculated, then the speech recognition with unadapted acoustic models was performed. In the next step the retraining of acoustic models and maximum likelihood linear regression procedure were used for adaptation. In the last step, the adapted acoustic models were used for speech recognition with the PoliDat database. The adaptation procedures significantly improved the mobile speech recognition with fixed acoustic models. The overall word error rate decreased from 46.5% for unadapted models to 19.1% and 5.2% for adapted models.

Details

Paper ID
lrec2002-main-089
Pages
N/A
BibKey
zgank-etal-2002-preliminary
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
N/A
Conference
Third International Conference on Language Resources and Evaluation
Location
Las Palmas, Spain
Date
29 May 2002 31 May 2002

Authors

  • Andrej Žgank

  • ZK

    Zdravko Kačič

  • BH

    Bogomir Horvat

Links