Back to Main Conference 2014
LREC 2014main

The Slovene BNSI Broadcast News database and reference speech corpus GOS: Towards the uniform guidelines for future work

Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014)

DOI:10.63317/485wzaivsamx

Abstract

The aim of the paper is to search for common guidelines for the future development of speech databases for less resourced languages in order to make them the most useful for both main fields of their use, linguistic research and speech technologies. We compare two standards for creating speech databases, one followed when developing the Slovene speech database for automatic speech recognition ― BNSI Broadcast News, the other followed when developing the Slovene reference speech corpus GOS, and outline possible common guidelines for future work. We also present an add-on for the GOS corpus, which enables its usage for automatic speech recognition.

Details

Paper ID
lrec2014-main-558
Pages
pp. 2644-2647
BibKey
zgank-etal-2014-slovene
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-8-4
Conference
Ninth International Conference on Language Resources and Evaluation
Location
Reykjavik, Iceland
Date
26 May 2014 31 May 2014

Authors

  • Andrej Žgank

  • AV

    Ana Zwitter Vitez

  • DV

    Darinka Verdonik

Links