Back to Main Conference 2012
LREC 2012main

Identifying bilingual Multi-Word Expressions for Statistical Machine Translation

Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC 2012)

DOI:10.63317/3jrrxjnqy96m

Abstract

MultiWord Expressions (MWEs) repesent a key issue for numerous applications in Natural Language Processing (NLP) especially for Machine Translation (MT). In this paper, we describe a strategy for detecting translation pairs of MWEs in a French-English parallel corpus. In addition we introduce three methods aiming to integrate extracted bilingual MWE S in M OSES, a phrase based Statistical Machine Translation (SMT) system. We experimentally show that these textual units can improve translation quality.

Details

Paper ID
lrec2012-main-527
Pages
pp. 674-679
BibKey
bouamor-etal-2012-identifying
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-7-7
Conference
Eighth International Conference on Language Resources and Evaluation
Location
Istanbul, Turkey
Date
21 May 2012 27 May 2012

Authors

  • DB

    Dhouha Bouamor

  • NS

    Nasredine Semmar

  • PZ

    Pierre Zweigenbaum

Links