Back to Main Conference 2008
LREC 2008main
Comparing Corpus-based to Web-based Lookup Techniques for Automatic English Inclusion Detection
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC 2008)
Abstract
The influence of English as a global language continues to grow to an extent that its words and expressions permeate the original forms of other languages. This paper evaluates a modular Web-based sub-component of an existing English inclusion classifier and compares it to a corpus-based lookup technique. Both approaches are evaluated on a German gold standard data set. It is demonstrated to what extent the Web-based approach benefits from the amount of data available online and the fact that this data is constantly updated.