Back to Main Conference 2006
LREC 2006main
Statistical Analysis for Thesaurus Construction using an Encyclopedic Corpus
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC 2006)
Abstract
This paper proposes a discrimination method for hierarchical relationsbetween word pairs. The method is a statistical one using an “encyclopedic corpus”' extracted and organized from Web pages.In the proposed method, we use the statistical naturethat hyponyms' descriptionstend to include hypernyms whereas hypernyms' descriptions do notinclude all of the hyponyms.Experimental results show that the method detected 61.7% of therelations in an actual thesaurus.