Back to Main Conference 2018
LREC 2018main

Delta vs. N-Gram Tracing: Evaluating the Robustness of Authorship Attribution Methods

Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

DOI:10.63317/4u4qcrjt5isy

Abstract

Delta measures are a well-established and popular family of authorship attribution methods, especially for literary texts. N-gram tracing is a novel method for authorship attribution designed for very short texts, which has its roots in forensic linguistics. We evaluate the performance of both methods in a series of experiments on English, French and German literary texts, in order to investigate the relationship between authorship attribution accuracy and text length as well as the composition of the comparison corpus. Our results show that, at least in our setting, both methods require relatively long texts and are furthermore highly sensitive to the choice of authors and texts in the comparison corpus.

Details

Paper ID
lrec2018-main-523
Pages
N/A
BibKey
proisl-etal-2018-delta
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-00-9
Conference
Eleventh International Conference on Language Resources and Evaluation
Location
Miyazaki, Japan
Date
7 May 2018 12 May 2018

Authors

  • TP

    Thomas Proisl

  • SE

    Stefan Evert

  • FJ

    Fotis Jannidis

  • CS

    Christof Schöch

  • LK

    Leonard Konle

  • SP

    Steffen Pielström

Links