Back to Main Conference 2022
LREC 2022main

Construction of a Quality Estimation Dataset for Automatic Evaluation of Japanese Grammatical Error Correction

Proceedings of the Thirteenth International Conference on Language Resources and Evaluation (LREC 2022)

DOI:10.63317/2skmcvitv7by

Abstract

In grammatical error correction (GEC), automatic evaluation is considered as an important factor for research and development of GEC systems. Previous studies on automatic evaluation have shown that quality estimation models built from datasets with manual evaluation can achieve high performance in automatic evaluation of English GEC. However, quality estimation models have not yet been studied in Japanese, because there are no datasets for constructing quality estimation models. In this study, therefore, we created a quality estimation dataset with manual evaluation to build an automatic evaluation model for Japanese GEC. By building a quality estimation model using this dataset and conducting a meta-evaluation, we verified the usefulness of the quality estimation model for Japanese GEC.

Details

Paper ID
lrec2022-main-596
Pages
pp. 5565-5572
BibKey
suzuki-etal-2022-construction
Editors
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Jan Odijk, Stelios Piperidis2020
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-38-2
Conference
Thirteenth Language Resources and Evaluation Conference
Location
Marseille, France
Date
20 - 25 June 2022

Authors

  • DS

    Daisuke Suzuki

  • YT

    Yujin Takahashi

  • IY

    Ikumi Yamashita

  • TA

    Taichi Aida

  • TH

    Tosho Hirasawa

  • MN

    Michitaka Nakatsuji

  • MM

    Masato Mita

  • MK

    Mamoru Komachi

Links