Back to Main Conference 2006
LREC 2006main

SParseval: Evaluation Metrics for Parsing Speech

Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC 2006)

DOI:10.63317/2d6a8bqw5fqi

Abstract

While both spoken and written language processing stand to benefit from parsing, the standard Parseval metrics (Black et al., 1991) and their canonical implementation (Sekine and Collins, 1997) are only useful for text. The Parseval metrics are undefined when the words input to the parser do not match the words in the gold standard parse tree exactly, and word errors are unavoidable with automatic speech recognition (ASR) systems. To fill this gap, we have developed a publicly available tool for scoring parses that implements a variety of metrics which can handle mismatches in words and segmentations, including: alignment-based bracket evaluation, alignment-based dependency evaluation, and a dependency evaluation that does not require alignment. We describe the different metrics, how to use the tool, and the outcome of an extensive set of experiments on the sensitivity.

Details

Paper ID
lrec2006-main-060
Pages
N/A
BibKey
roark-etal-2006-sparseval
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-2-4
Conference
Fifth International Conference on Language Resources and Evaluation
Location
Genoa, Italy
Date
24 May 2006 26 May 2006

Authors

  • BR

    Brian Roark

  • MH

    Mary Harper

  • EC

    Eugene Charniak

  • BD

    Bonnie Dorr

  • MJ

    Mark Johnson

  • JK

    Jeremy Kahn

  • YL

    Yang Liu

  • MO

    Mari Ostendorf

  • JH

    John Hale

  • AK

    Anna Krasnyanskaya

  • ML

    Matthew Lease

  • IS

    Izhak Shafran

  • MS

    Matthew Snover

  • RS

    Robin Stewart

  • LY

    Lisa Yung

Links