Back to Main Conference 2010
LREC 2010main

A Dataset for Assessing Machine Translation Evaluation Metrics

Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC 2010)

DOI:10.63317/2iotwi4xhg3p

Abstract

We describe a dataset containing 16,000 translations produced by four machine translation systems and manually annotated for quality by professional translators. This dataset can be used in a range of tasks assessing machine translation evaluation metrics, from basic correlation analysis to training and test of machine learning-based metrics. By providing a standard dataset for such tasks, we hope to encourage the development of better MT evaluation metrics.

Details

Paper ID
lrec2010-main-349
Pages
N/A
BibKey
specia-etal-2010-dataset
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-6-7
Conference
Seventh International Conference on Language Resources and Evaluation
Location
Valletta, Malta
Date
17 May 2010 23 May 2010

Authors

  • LS

    Lucia Specia

  • NC

    Nicola Cancedda

  • MD

    Marc Dymetman

Links