Back to Main Conference 2016
LREC 2016main

A Bilingual Discourse Corpus and Its Applications

Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)

DOI:10.63317/229wetw637gx

Abstract

Existing discourse research only focuses on the monolingual languages and the inconsistency between languages limits the power of the discourse theory in multilingual applications such as machine translation. To address this issue, we design and build a bilingual discource corpus in which we are currently defining and annotating the bilingual elementary discourse units (BEDUs). The BEDUs are then organized into hierarchical structures. Using this discourse style, we have annotated nearly 20K LDC sentences. Finally, we design a bilingual discourse based method for machine translation evaluation and show the effectiveness of our bilingual discourse annotations.

Details

Paper ID
lrec2016-main-159
Pages
pp. 1002-1007
BibKey
liu-etal-2016-bilingual
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-9-1
Conference
Tenth International Conference on Language Resources and Evaluation
Location
Portorož, Slovenia
Date
23 May 2016 28 May 2016

Authors

  • YL

    Yang Liu

  • JZ

    Jiajun Zhang

  • CZ

    Chengqing Zong

  • YY

    Yating Yang

  • XZ

    Xi Zhou

Links