Back to Main Conference 2004
LREC 2004main

Collecting and Sharing Bilingual Spontaneous Speech Corpora: the ChinFaDial Experiment

Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004)

DOI:10.63317/4fnbjsnjy7pz

Abstract

We describe here the three main platforms in the ERIM family of Web-based environments for human interpreting, two of them in more details – ERIM-Interp and ERIM-Collect –, then ERIM-Aid. Each platform supports an aspect of the collecting or study of spontaneous bilingual dialogues, translated by an interpreter. ERIM-Interp is the core environment, providing mediated communication between speakers and human interpreters over the network. Using ERIM-Collect, French-Chinese interpreting data have been collected within the three-year "ChinFaDial" project supported by LIAMA, a French-Chinese laboratory in Beijing. These "raw" speech data will be made available in the spring of 2004 on an open-access basis, using the DistribDial server, on a CLIPS-GETA website. Our goal is to extend such corpora, on a collaborative scheme, to allow other research groups to contribute to the site whatever annotations they may have created, and to share them under the same conditions (GPL). An ERIM-Aid variant is intended to provide focused machine aids to human interpreters working over the Web, or possibly to distant monolingual speakers conversing in different languges.

Details

Paper ID
lrec2004-main-461
Pages
N/A
BibKey
fafiotte-etal-2004-collecting
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-1-6
Conference
Fourth International Conference on Language Resources and Evaluation
Location
Lisbon, Portugal
Date
26 May 2004 28 May 2004

Authors

  • GF

    Georges Fafiotte

  • CB

    Christian Boitet

  • MS

    Mark Seligman

  • CZ

    Chengqing Zong

Links