Back to Main Conference 2002
LREC 2002main

n-grams of Seeds: A Hybrid System for Corpus-Based Text Summarization

Proceedings of the Third International Conference on Language Resources and Evaluation (LREC 2002)

DOI:10.63317/2mg7qeehcnp8

Abstract

This paper presents a hybrid system for automatic text summarization which combines statistical and knowledge-based methods. In particular, it demonstrates how two corpus-based learning and indexing algorithms, namely an n-gram and a seed-oriented approach, may be combined to bring out the best of both approaches. This system selects sentences from an input text to constract a highly compressed, generic, and informative summary. The hybrid algorithm described here was developed and tested with a corpus of movie reviews collected from several on-line data bases.

Details

Paper ID
lrec2002-main-021
Pages
N/A
BibKey
schneider-2002-n
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
N/A
Conference
Third International Conference on Language Resources and Evaluation
Location
Las Palmas, Spain
Date
29 May 2002 31 May 2002

Authors

  • RS

    René Schneider

Links