Back to Main Conference 2010
LREC 2010main

Achieving Domain Specificity in SMT without Overt Siloing

Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC 2010)

DOI:10.63317/3jiwz2e4uao2

Abstract

We examine pooling data as a method for improving Statistical Machine Translation (SMT) quality for narrowly defined domains, such as data for a particular company or public entity. By pooling all available data, building large SMT engines, and using domain-specific target language models, we see boosts in quality, and can achieve the generalizability and resiliency of a larger SMT but with the precision of a domain-specific engine.

Details

Paper ID
lrec2010-main-545
Pages
N/A
BibKey
lewis-etal-2010-achieving
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-6-7
Conference
Seventh International Conference on Language Resources and Evaluation
Location
Valletta, Malta
Date
17 May 2010 23 May 2010

Authors

  • WL

    William D. Lewis

  • CW

    Chris Wendt

  • DB

    David Bullock

Links