Title

Multi-Document Summarization using Multiple-Sequence Alignment

Author(s)

V. Finley Lacatusu, Steven J. Maiorano, Sanda M. Harabagiu

Human Language Technology Research Institute, University of Texas at Dallas

Session

O14-W

Abstract

This paper describes a novel clustering-based text summarization system that uses Multiple Sequence Alignment to improve the alignment of sentences within topic clusters. While most current clustering-based summarization systems base their summaries only on the common information contained in a collection of highly-related sentences, our system constructs more informative summaries that incorporate both the redundant and unique contributions of the sentences in the cluster. When evaluated using ROUGE, the summaries produced by our system represent a substantial improvement over the baseline, which is at 63% of the human performance.

Keyword(s)

Summarization

Language(s)

English

Full Paper

408.pdf