Back to Main Conference 2006
LREC 2006main

Bikers Accessing the Web: The SmartWeb Motorbike Corpus

Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC 2006)

DOI:10.63317/5je5tm7wjmvr

Abstract

Three advanced German speech corpora have been collected during theGerman SmartWeb project. One of them, the SmartWeb MotorbikeCorpus (SMC) is described in this paper.As with all SmartWeb speech corpora SMC is designed for a dialogue system dealing with open domains.The corpus is recorded under the special circumstances of a motorbike ride and contains utterances of the driver related to information retrieval from various sources and different topics. Audio tracks show characteristic noise from the engine and surrounding traffic as well as drop outs caused by the transmission over Bluetooth and the UMTS mobile network. We discuss the problems of the technical setup and the fully automatic evocation of natural-spoken queries by means of dialogue-like sequences.

Details

Paper ID
lrec2006-main-154
Pages
N/A
BibKey
kaiser-etal-2006-bikers
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-2-4
Conference
Fifth International Conference on Language Resources and Evaluation
Location
Genoa, Italy
Date
24 May 2006 26 May 2006

Authors

  • MK

    Moritz Kaiser

  • HM

    Hannes Mögele

  • FS

    Florian Schiel

Links