Back to Main Conference 2022
LREC 2022main

StyleKQC: A Style-Variant Paraphrase Corpus for Korean Questions and Commands

Proceedings of the Thirteenth International Conference on Language Resources and Evaluation (LREC 2022)

DOI:10.63317/54o84q3omjwi

Abstract

Paraphrasing is often performed with less concern for controlled style conversion. Especially for questions and commands, style-variant paraphrasing can be crucial in tone and manner, which also matters with industrial applications such as dialog systems. In this paper, we attack this issue with a corpus construction scheme that simultaneously considers the core content and style of directives, namely intent and formality, for the Korean language. Utilizing manually generated natural language queries on six daily topics, we expand the corpus to formal and informal sentences by human rewriting and transferring. We verify the validity and industrial applicability of our approach by checking the adequate classification and inference performance that fit with conventional fine-tuning approaches, at the same time proposing a supervised formality transfer task.

Details

Paper ID
lrec2022-main-771
Pages
pp. 7122-7128
BibKey
cho-etal-2022-stylekqc
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-38-2
Conference
Thirteenth Language Resources and Evaluation Conference
Location
Marseille, France
Date
20 June 2022 25 June 2022

Authors

  • WC

    Won Ik Cho

  • SM

    Sangwhan Moon

  • JK

    Jongin Kim

  • SK

    Seokmin Kim

  • NK

    Nam Soo Kim

Links