Back to Main Conference 2002
LREC 2002main

Proposal of a very-large-corpus acquisition method by cell-formed registration

Proceedings of the Third International Conference on Language Resources and Evaluation (LREC 2002)

DOI:10.63317/39wds9erdb8m

Abstract

One promising way to improve the performance of a speech translation system is to collect a large volume of data in the target tasks/domains. However, a naïve expansion of the traditional data collection scheme consumes valuable resources. Advanced speech recognition technology can provide a highly accurate recognizer if a machine-friendly speech is permitted. We propose a new data collection scheme that is supported by this speaking style. The preliminary results of data collection show that the proposed scheme has a three-digit efficiency.

Details

Paper ID
lrec2002-main-309
Pages
N/A
BibKey
suyaga-etal-2002-proposal
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
N/A
Conference
Third International Conference on Language Resources and Evaluation
Location
Las Palmas, Spain
Date
29 May 2002 31 May 2002

Authors

  • FS

    Fumiaki Suyaga

  • TT

    Toshiyuki Takezawa

  • GK

    Genichiro Kikui

  • SY

    Seiichi Yamamoto

Links