Back to Main Conference 2004
LREC 2004main

Generic Text Summarization Using WordNet

Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004)

DOI:10.63317/5dnot53ivcvt

Abstract

This paper presents a WordNet based approach to text summarization. The document to be summarized is used to extract a “relevant” sub-graph from the WordNet graph. Weights are assigned to each node of this sub-graph using a strategy similar to the Google Pageranking algorithm. These weights capture the relevance of the respective synsets with respect to the whole document. A matrix in which each row repesents a sentence and each column a node of the sub-graph (i.e., a synset) is created. Principal Component Analysis is performed on this matrix to help extract the sentences for the summary. Our approach is generic unlike most previous approaches which address specific genres of documents like news articles and biographies. Testing our system on the standard DUC2002 extracts shows that our results are promising and comparable to existing summarizers.

Details

Paper ID
lrec2004-main-187
Pages
N/A
BibKey
bellare-etal-2004-generic
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-1-6
Conference
Fourth International Conference on Language Resources and Evaluation
Location
Lisbon, Portugal
Date
26 May 2004 28 May 2004

Authors

  • KB

    Kedar Bellare

  • AS

    Anish Das Sarma

  • AS

    Atish Das Sarma

  • NL

    Navneet Loiwal

  • VM

    Vaibhav Mehta

  • GR

    Ganesh Ramakrishnan

  • PB

    Pushpak Bhattacharyya

Links