Back to Main Conference 2004
LREC 2004main

SALA II Across the Finish Line: A Large Collection of Mobile Telephone Speech Databases from North and Latin America completed

Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004)

DOI:10.63317/5dvp6ckmy2kh

Abstract

The SALA II project comprises mobile telephone recordings according to the SpeechDat (II) paradigm for several languages in North and Latin America. Each database contains the recordings of 1000 speakers, with the exception of US Spanish (2000 speakers) and US English (4000 speakers). A quarter of the recordings of each database are made respectively in a quiet environment (home/office), in the street, in a public place, and in a moving vehicle. This paper presents an evaluation of the project. The paper details on experiences with respect to the implementation of design specifications, speaker recruitment, data recordings (on site), data processing, orthographic transcription and lexicon generation. Furthermore, the validation procedure and its results are documented. Finally, the availability and distribution of the databases are addressed.

Details

Paper ID
lrec2004-main-153
Pages
N/A
BibKey
van-den-heuvel-etal-2004-sala
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-1-6
Conference
Fourth International Conference on Language Resources and Evaluation
Location
Lisbon, Portugal
Date
26 May 2004 28 May 2004

Authors

  • Hv

    Henk van den Heuvel

  • PH

    Phil Hall

  • HH

    Harald Höge

  • AM

    Asunción Moreno

  • AR

    Antonio Rincon

  • FS

    Francesco Senia

Links