Back to Main Conference 2018
LREC 2018main

Identifying Speakers and Addressees in Dialogues Extracted from Literary Fiction

Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

DOI:10.63317/3fbftv65cwoy

Abstract

This paper describes an approach to identifying speakers and addressees in dialogues extracted from literary fiction, along with a dataset annotated for speaker and addressee. The overall purpose of this is to provide annotation of dialogue interaction between characters in literary corpora in order to allow for enriched search facilities and construction of social networks from the corpora. To predict speakers and addressees in a dialogue, we use a sequence labeling approach applied to a given set of characters. We use features relating to the current dialogue, the preceding narrative, and the complete preceding context. The results indicate that even with a small amount of training data, it is possible to build a fairly accurate classifier for speaker and addressee identification across different authors, though the identification of addressees is the more difficult task.

Details

Paper ID
lrec2018-main-131
Pages
N/A
BibKey
ek-etal-2018-identifying
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-00-9
Conference
Eleventh International Conference on Language Resources and Evaluation
Location
Miyazaki, Japan
Date
7 May 2018 12 May 2018

Authors

  • AE

    Adam Ek

  • MW

    Mats Wirén

  • Robert Östling

  • KN

    Kristina N. Björkenstam

  • GG

    Gintarė Grigonytė

  • SG

    Sofia Gustafson Capková

Links