Back to Main Conference 2004
LREC 2004main

The Fisher Corpus: a Resource for the Next Generations of Speech-to-Text

Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004)

DOI:10.63317/5q862povzthr

Abstract

This paper describes, within the context of the DARPA EARS program, the design and implementation of the Fisher protocol for collecting conversational telephone speech which has yielded more than 16,000 English conversations. It also discusses the Quick Transcription specification that allowed 2000 hours of Fisher audio to be transcribed in less than one year. Fisher data is already in use within the DARPA EARS programs and will be published via the Linguistic Data Consortium for general use beginning in 2004.

Details

Paper ID
lrec2004-main-500
Pages
N/A
BibKey
cieri-etal-2004-fisher
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-1-6
Conference
Fourth International Conference on Language Resources and Evaluation
Location
Lisbon, Portugal
Date
26 May 2004 28 May 2004

Authors

  • CC

    Christopher Cieri

  • DM

    David Miller

  • KW

    Kevin Walker

Links