Back to DETERMIT 2024
LREC-COLING 2024workshop

An Approach towards Unsupervised Text Simplification on Paragraph-Level for German Texts

Proceedings of the Workshop on DeTermIt! Evaluating Text Difficulty in a Multilingual Context @ LREC-COLING 2024

DOI:10.63317/2c2wn8pzba6n

Abstract

Text simplification as a research field has received attention in recent years for English and other languages, however, German text simplification techniques are lacking thus far. We present an unsupervised simplification approach for German texts using reinforcement learning (self-critical sequence training). Our main contributions are the adaption of an existing method for English, the selection and creation of German corpora for this task and the customization of rewards for particular aspects of the German language. In our paper, we describe our system and an evaluation, including still present issues and problems due to the complexity of the German language, as well as directions for future research.

Details

Paper ID
lrec2024-ws-determit-08
Pages
pp. 77-89
BibKey
fruth-etal-2024-approach
Editor
N/A
Publisher
European Language Resources Association (ELRA) and ICCL
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the Workshop on DeTermIt! Evaluating Text Difficulty in a Multilingual Context @ LREC-COLING 2024
Location
undefined, undefined
Date
20 May 2024 25 May 2024

Authors

  • LF

    Leon Fruth

  • RJ

    Robin Jegan

  • AH

    Andreas Henrich

Links