Back to Main Conference 2022
LREC 2022main

QA4IE: A Quality Assurance Tool for Information Extraction

Proceedings of the Thirteenth International Conference on Language Resources and Evaluation (LREC 2022)

DOI:10.63317/4a7d3jbzdwzt

Abstract

Quality assurance (QA) is an essential though underdeveloped part of the data annotation process. Although QA is supported to some extent in existing annotation tools, comprehensive support for QA is not standardly provided. In this paper we contribute QA4IE, a comprehensive QA tool for information extraction, which can (1) detect potential problems in text annotations in a timely manner, (2) accurately assess the quality of annotations, (3) visually display and summarize annotation discrepancies among annotation team members, (4) provide a comprehensive statistics report, and (5) support viewing of annotated documents interactively. This paper offers a competitive analysis comparing QA4IE and other popular annotation tools and demonstrates its features, usage, and effectiveness through a case study. The Python code, documentation, and demonstration video are available publicly at https://github.com/CC-RMD-EpiBio/QA4IE.

Details

Paper ID
lrec2022-main-478
Pages
pp. 4497-4503
BibKey
silva-etal-2022-qa4ie
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-38-2
Conference
Thirteenth Language Resources and Evaluation Conference
Location
Marseille, France
Date
20 June 2022 25 June 2022

Authors

  • RS

    Rafael Jimenez Silva

  • KG

    Kaushik Gedela

  • AM

    Alex Marr

  • BD

    Bart Desmet

  • CR

    Carolyn Rose

  • CZ

    Chunxiao Zhou

Links