Back to Main Conference 2026
LREC 2026main

NOVELSUM: Evaluating Long-Form Summary Generation for Historical Scandinavian Novels

Proceedings of the Fifteenth Language Resources and Evaluation Conference (LREC 2026)

DOI:10.63317/22upgvjw86b9

Abstract

We study long-form summarization of late-19th-century Danish and Norwegian novels and propose NOVELSUM, an evaluation resource and protocol tailored to literary narrative. We use a curated set of historical novels paired with professional reference summaries to establish baselines with long-document encoder–decoder models and prompt-based large-context LLMs. We evaluate with automatic metrics, expert human judgments, and LLM-as-judge scoring. Our human study identifies evaluation dimensions and literary facets that achieve substantial inter-annotator agreement and align with scholarly expectations. We further analyze reference-free evaluation, showing when it tracks expert trends and where it fails (notably for factual and setting-related criteria), thereby clarifying its utility when gold references or expert readers are unavailable. Our results benchmark long-context and prompted LLM approaches on historical literary prose and offer a practical path for human-grounded and reference-free assessment.

Details

Paper ID
lrec2026-main-780
Pages
pp. 9953-9963
BibKey
allaith-etal-2026-novelsum
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-493814-49-4
Conference
The Fifteenth Language Resources and Evaluation Conference (LREC 2026)
Location
Palma, Mallorca, Spain
Date
11 May 2026 16 May 2026

Authors

  • AA

    Ali Al-Laith

  • AC

    Alexander Conroy

  • KD

    Kirstine Nielsen Degn

  • JB

    Jens Bjerring-Hansen

  • DH

    Daniel Hershcovich

Links