Back to Main Conference 2004
LREC 2004main

A Similarity Measure for Unsupervised Semantic Disambiguation

Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004)

DOI:10.63317/429yb25hzfks

Abstract

In this paper we propose a similarity measure aimed to support an unsupervised approach to semantic tagging. This proposal represents a variant of the notion of "Conceptual Density" previously suggested as a tool for sense disambiguation. However, the major difference is the learning framework in which this measure applied to the Wordnet hierarchy enables a "natural" corpus-driven empirical estimation of lexical and contextual probabilities for probabilistic semantic tagging. Experimental results over an hand-annotated portion of the British National Corpus (about 5 M words) are also discussed. Although below the results obtained by a supervised method (Maximum Entropy trained over hand labelled data), the proposed unsupervised tagger confirms the effectiveness of the proposed metric as well as show a promising research direction.

Details

Paper ID
lrec2004-main-471
Pages
N/A
BibKey
basili-etal-2004-similarity
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-1-6
Conference
Fourth International Conference on Language Resources and Evaluation
Location
Lisbon, Portugal
Date
26 May 2004 28 May 2004

Authors

  • RB

    Roberto Basili

  • MC

    Marco Cammisa

  • FZ

    Fabio Massimo Zanzotto

Links