Back to Main Conference 2012
LREC 2012main

Analyzing the Impact of Prevalence on the Evaluation of a Manual Annotation Campaign

Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC 2012)

DOI:10.63317/2gnpsbdnmnos

Abstract

This article details work aiming at evaluating the quality of the manual annotation of gene renaming couples in scientific abstracts, which generates sparse annotations. To evaluate these annotations, we compare the results obtained using the commonly advocated inter-annotator agreement coefficients such as S, κ and Ï€, the less known R, the weighted coefficients κω and α as well as the F-measure and the SER. We analyze to which extent they are relevant for our data. We then study the bias introduced by prevalence by changing the way the contingency table is built. We finally propose an original way to synthesize the results by computing distances between categories, based on the produced annotations.

Details

Paper ID
lrec2012-main-310
Pages
pp. 1474-1480
BibKey
fort-etal-2012-analyzing
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-7-7
Conference
Eighth International Conference on Language Resources and Evaluation
Location
Istanbul, Turkey
Date
21 May 2012 27 May 2012

Authors

  • KF

    Karën Fort

  • CF

    Claire François

  • OG

    Olivier Galibert

  • MG

    Maha Ghribi

Links