Back to Main Conference 2008
LREC 2008main

Evaluating Summaries Automatically - A system Proposal

Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC 2008)

DOI:10.63317/4agoavpjst6y

Abstract

We propose in this paper an automatic evaluation procedure based on a metric which could provide summary evaluation without human assistance. Our system includes two metrics, which are presented and discussed. The first metric is based on a known and powerful statistical test, the X2 goodness-of-fit test, and has been used in several applications. The second metric is derived from three common metrics used to evaluate Natural Language Processing (NLP) systems, namely precision, recall and f-measure. The combination of these two metrics is intended to allow one to assess the quality of summaries quickly, cheaply and without the need of human intervention, minimizing though, the role of subjective judgment and bias.

Details

Paper ID
lrec2008-main-063
Pages
N/A
BibKey
de-oliveira-etal-2008-evaluating
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-4-0
Conference
Sixth International Conference on Language Resources and Evaluation
Location
Marrakech, Morocco
Date
28 May 2008 30 May 2008

Authors

  • Pd

    Paulo C F de Oliveira

  • ET

    Edson Wilson Torrens

  • AC

    Alexandre Cidral

  • SS

    Sidney Schossland

  • EB

    Evandro Bittencourt

Links