Back to Main Conference 2024
LREC-COLING 2024main

DocScript: Document-level Script Event Prediction

Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

DOI:10.63317/4vn863mawyo2

Abstract

We present a novel task of document-level script event prediction, which aims to predict the next event given a candidate list of narrative events in long-form documents. To enable this, we introduce DocSEP, a challenging dataset in two new domains - contractual documents and Wikipedia articles, where timeline events may be paragraphs apart and may require multi-hop temporal and causal reasoning. We benchmark existing baselines and present a novel architecture called DocScript to learn sequential ordering between events at the document scale. Our experimental results on the DocSEP dataset demonstrate that learning longer-range dependencies between events is a key challenge and show that contemporary LLMs such as ChatGPT and FlanT5 struggle to solve this task, indicating their lack of reasoning abilities for understanding causal relationships and temporal sequences within long texts.

Details

Paper ID
lrec2024-main-0458
Pages
pp. 5140-5155
BibKey
mathur-etal-2024-docscript
Editor
N/A
Publisher
European Language Resources Association (ELRA) and ICCL
ISSN
2522-2686
ISBN
979-10-95546-34-4
Conference
Joint International Conference on Computational Linguistics, Language Resources and Evaluation
Location
Turin, Italy
Date
20 May 2024 25 May 2024

Authors

  • PM

    Puneet Mathur

  • VM

    Vlad I. Morariu

  • AG

    Aparna Garimella

  • FD

    Franck Dernoncourt

  • JG

    Jiuxiang Gu

  • RS

    Ramit Sawhney

  • PN

    Preslav Nakov

  • DM

    Dinesh Manocha

  • RJ

    Rajiv Jain

Links