Back to Main Conference 2008
LREC 2008main

The MoveOn Motorcycle Speech Corpus

Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC 2008)

DOI:10.63317/4a4ddrq6y6ky

Abstract

A speech and noise corpus dealing with the extreme conditions of the motorcycle environment is developed within the MoveOn project. Speech utterances in British English are recorded and processed approaching the issue of command and control and template driven dialog systems on the motorcycle. The major part of the corpus comprises noisy speech and environmental noise recorded on a motorcycle, but several clean speech recordings in a silent environment are also available. The corpus development focuses on distortion free recordings and accurate descriptions of both recorded speech and noise. Not only speech segments are annotated but also annotation of environmental noise is performed. The corpus is a small-sized speech corpus with about 12 hours of clean and noisy speech utterances and about 30 hours of segments with environmental noise without speech. This paper addresses the motivation and development of the speech corpus and finally presents some statistics and results of the database creation.

Details

Paper ID
lrec2008-main-508
Pages
N/A
BibKey
winkler-etal-2008-moveon
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-4-0
Conference
Sixth International Conference on Language Resources and Evaluation
Location
Marrakech, Morocco
Date
28 May 2008 — 30 May 2008

Authors

  • TW

    Thomas Winkler

  • TK

    Theodoros Kostoulas

  • RA

    Richard Adderley

  • CB

    Christian Bonkowski

  • TG

    Todor Ganchev

  • JK

    Joachim Köhler

  • NF

    Nikos Fakotakis

Links