HomeLREC 2022WorkshopsISAlrec2022-ws-isa-04
Back to ISA 2022
LREC 2022workshop

Levels of Non-Fictionality in Fictional Texts

Proceedings of the 18th Joint ACL - ISO Workshop on Interoperable Semantic Annotation within LREC2022

DOI:10.63317/3pgpwbaaqcfb

Abstract

The annotation and automatic recognition of non-fictional discourse within a text is an important, yet unresolved task in literary research. While non-fictional passages can consist of several clauses or sentences, we argue that 1) an entity-level classification of fictionality and 2) the linking of Wikidata identifiers can be used to automatically identify (non-)fictional discourse. We query Wikidata and DBpedia for relevant information about a requested entity as well as the corresponding literary text to determine the entity’s fictionality status and assign a Wikidata identifier, if unequivocally possible. We evaluate our methods on an exemplary text from our diachronic literary corpus, where our methods classify 97% of persons and 62% of locations correctly as fictional or real. Furthermore, 75% of the resolved persons and 43% of the resolved locations are resolved correctly. In a quantitative experiment, we apply the entity-level fictionality tagger to our corpus and conclude that more non-fictional passages can be identified when information about real entities is available.

Details

Paper ID
lrec2022-ws-isa-04
Pages
pp. 27-32
BibKey
barth-etal-2022-levels
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the 18th Joint ACL - ISO Workshop on Interoperable Semantic Annotation within LREC2022
Location
undefined, undefined
Date
20 June 2022 25 June 2022

Authors

  • FB

    Florian Barth

  • HV

    Hanna Varachkina

  • TD

    Tillmann Dönicke

  • LG

    Luisa Gödeke

Links