Back to Main Conference 2022
LREC 2022main

Construction of a Quality Estimation Dataset for Automatic Evaluation of Japanese Grammatical Error Correction

Proceedings of the Thirteenth International Conference on Language Resources and Evaluation (LREC 2022)

DOI:10.63317/2skmcvitv7by

Abstract

In grammatical error correction (GEC), automatic evaluation is considered as an important factor for research and development of GEC systems. Previous studies on automatic evaluation have shown that quality estimation models built from datasets with manual evaluation can achieve high performance in automatic evaluation of English GEC. However, quality estimation models have not yet been studied in Japanese, because there are no datasets for constructing quality estimation models. In this study, therefore, we created a quality estimation dataset with manual evaluation to build an automatic evaluation model for Japanese GEC. By building a quality estimation model using this dataset and conducting a meta-evaluation, we verified the usefulness of the quality estimation model for Japanese GEC.

Details

Paper ID
lrec2022-main-596
Pages
pp. 5565-5572
BibKey
suzuki-etal-2022-construction
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-38-2
Conference
Thirteenth Language Resources and Evaluation Conference
Location
Marseille, France
Date
20 June 2022 25 June 2022

Authors

  • DS

    Daisuke Suzuki

  • YT

    Yujin Takahashi

  • IY

    Ikumi Yamashita

  • TA

    Taichi Aida

  • TH

    Tosho Hirasawa

  • MN

    Michitaka Nakatsuji

  • MM

    Masato Mita

  • MK

    Mamoru Komachi

Links