Back to Main Conference 2014
LREC 2014main
Pivot-based multilingual dictionary building using Wiktionary
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014)
Abstract
We describe a method for expanding existing dictionaries in several languages by discovering previously non-existent links between translations. We call this method triangulation and we present and compare several variations of it. We assess precision manually, and recall by comparing the extracted dictionaries with independently obtained basic vocabulary sets. We featurize the translation candidates and train a maximum entropy classifier to identify correct translations in the noisy data.