Back to Main Conference 2016
LREC 2016main

Parallel Sentence Extraction from Comparable Corpora with Neural Network Features

Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)

DOI:10.63317/33ujzhai4ek7

Abstract

Parallel corpora are crucial for machine translation (MT), however they are quite scarce for most language pairs and domains. As comparable corpora are far more available, many studies have been conducted to extract parallel sentences from them for MT. In this paper, we exploit the neural network features acquired from neural MT for parallel sentence extraction. We observe significant improvements for both accuracy in sentence extraction and MT performance.

Details

Paper ID
lrec2016-main-468
Pages
pp. 2931-2935
BibKey
chu-etal-2016-parallel
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-9-1
Conference
Tenth International Conference on Language Resources and Evaluation
Location
Portorož, Slovenia
Date
23 May 2016 28 May 2016

Authors

  • CC

    Chenhui Chu

  • RD

    Raj Dabre

  • SK

    Sadao Kurohashi

Links