Back to Main Conference 2002
LREC 2002main

Automatic extraction of differences between spoken and written languages, and automatic translation from the written to the spoken language

Proceedings of the Third International Conference on Language Resources and Evaluation (LREC 2002)

DOI:10.63317/4c4okq3oz29i

Abstract

We extracted the differences between spoken language and written language from a spoken-language corpus and a written-language corpus by using the UNIX command ``diff'' and examined the differences to determine the construction of the grammars of the two corpora. We also transformed written-language sentences into spoken-language sentences by using rules based on the extracted differences.

Details

Paper ID
lrec2002-main-027
Pages
N/A
BibKey
murata-isahara-2002-automatic
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
N/A
Conference
Third International Conference on Language Resources and Evaluation
Location
Las Palmas, Spain
Date
29 May 2002 31 May 2002

Authors

  • MM

    Masaki Murata

  • HI

    Hitoshi Isahara

Links