Back to Main Conference 2014
LREC 2014main

Basque Speecon-like and Basque SpeechDat MDB-600: speech databases for the development of ASR technology for Basque

Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014)

DOI:10.63317/23ufvgsm6txs

Abstract

This paper introduces two databases specifically designed for the development of ASR technology for the Basque language: the Basque Speecon-like database and the Basque SpeechDat MDB-600 database. The former was recorded in an office environment according to the Speecon specifications, whereas the later was recorded through mobile telephones according to the SpeechDat specifications. Both databases were created under an initiative that the Basque Government started in 2005, a program called ADITU, which aimed at developing speech technologies for Basque. The databases belong to the Basque Government. A comprehensive description of both databases is provided in this work, highlighting the differences with regard to their corresponding standard specifications. The paper also presents several initial experimental results for both databases with the purpose of validating their usefulness for the development of speech recognition technology. Several applications already developed with the Basque Speecon-like database are also described. Authors aim to make these databases widely known to the community as well, and foster their use by other groups.

Details

Paper ID
lrec2014-main-583
Pages
pp. 2658-2665
BibKey
odriozola-etal-2014-basque
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-8-4
Conference
Ninth International Conference on Language Resources and Evaluation
Location
Reykjavik, Iceland
Date
26 May 2014 31 May 2014

Authors

  • IO

    Igor Odriozola

  • IH

    Inma Hernaez

  • MT

    María Inés Torres

  • LR

    Luis Javier Rodriguez-Fuentes

  • MP

    Mikel Penagarikano

  • EN

    Eva Navas

Links