Back to Main Conference 2022
LREC 2022main

Subjective Text Complexity Assessment for German

Proceedings of the Thirteenth International Conference on Language Resources and Evaluation (LREC 2022)

DOI:10.63317/5dxoxk454yfy

Abstract

For different reasons, text can be difficult to read and understand for many people, especially if the text’s language is too complex. In order to provide suitable text for the target audience, it is necessary to measure its complexity. In this paper we describe subjective experiments to assess the readability of German text. We compile a new corpus of sentences provided by a German IT service provider. The sentences are annotated with the subjective complexity ratings by two groups of participants, namely experts and non-experts for that text domain. We then extract an extensive set of linguistically motivated features that are supposedly interacting with complexity perception. We show that a linear regression model with a subset of these features can be a very good predictor of text complexity.

Details

Paper ID
lrec2022-main-074
Pages
pp. 707-714
BibKey
seiffe-etal-2022-subjective
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-38-2
Conference
Thirteenth Language Resources and Evaluation Conference
Location
Marseille, France
Date
20 June 2022 25 June 2022

Authors

  • LS

    Laura Seiffe

  • FK

    Fares Kallel

  • SM

    Sebastian Möller

  • BN

    Babak Naderi

  • RR

    Roland Roller

Links