HomeLREC 2022WorkshopsREADIlrec2022-ws-readi-2
Back to READI 2022
LREC 2022workshop

Agree to Disagree: Exploring Subjectivity in Lexical Complexity

Proceedings of the 2nd Workshop on Tools and Resources to Empower People with REAding DIfficulties (READI) within the 13th Language Resources and Evaluation Conference

DOI:10.63317/5gymz4nty8sm

Abstract

Subjective factors affect our familiarity with different words. Our education, mother tongue, dialect or social group all contribute to the words we know and understand. When asking people to mark words they understand some words are unanimously agreed to be complex, whereas other annotators universally disagree on the complexity of other words. In this work, we seek to expose this phenomenon and investigate the factors affecting whether a word is likely to be subjective, or not. We investigate two recent word complexity datasets from shared tasks. We demonstrate that subjectivity is present and describable in both datasets. Further we show results of modelling and predicting the subjectivity of the complexity annotations in the most recent dataset, attaining an F1-score of 0.714.

Details

Paper ID
lrec2022-ws-readi-2
Pages
pp. 9-16
BibKey
shardlow-2022-agree
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the 2nd Workshop on Tools and Resources to Empower People with REAding DIfficulties (READI) within the 13th Language Resources and Evaluation Conference
Location
undefined, undefined
Date
20 June 2022 25 June 2022

Authors

  • MS

    Matthew Shardlow

Links