Back to Main Conference 2010
LREC 2010main
The More the Better? Assessing the Influence of Wikipedia’s Growth on Semantic Relatedness Measures
Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC 2010)
Abstract
Wikipedia has been used as a knowledge source in many areas of natural language processing. As most studies only use a certain Wikipedia snapshot, the influence of Wikipedias massive growth on the results is largely unknown. For the first time, we perform an in-depth analysis of this influence using semantic relatedness as an example application that tests a wide range of Wikipedias properties. We find that the growth of Wikipedia has almost no effect on the correlation of semantic relatedness measures with human judgments, while the coverage steadily increases.