Back to Main Conference 2010
LREC 2010main

The More the Better? Assessing the Influence of Wikipedia’s Growth on Semantic Relatedness Measures

Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC 2010)

DOI:10.63317/3mnkxbvaato7

Abstract

Wikipedia has been used as a knowledge source in many areas of natural language processing. As most studies only use a certain Wikipedia snapshot, the influence of Wikipedia’s massive growth on the results is largely unknown. For the first time, we perform an in-depth analysis of this influence using semantic relatedness as an example application that tests a wide range of Wikipedia’s properties. We find that the growth of Wikipedia has almost no effect on the correlation of semantic relatedness measures with human judgments, while the coverage steadily increases.

Details

Paper ID
lrec2010-main-055
Pages
N/A
BibKey
zesch-gurevych-2010-better
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-6-7
Conference
Seventh International Conference on Language Resources and Evaluation
Location
Valletta, Malta
Date
17 May 2010 23 May 2010

Authors

  • TZ

    Torsten Zesch

  • IG

    Iryna Gurevych

Links