Back to Main Conference 2026
LREC 2026main

Temporal Expression Recognition in Legal Transcripts

Proceedings of the Fifteenth Language Resources and Evaluation Conference (LREC 2026)

DOI:10.63317/5n7bd6gxobss

Abstract

In litigation, trial transcripts provide verbatim records of witness testimony, primarily given in response to attorney questioning. To effectively analyze these transcripts, lawyers must often reconstruct events in chronological order—a task that begins with identifying dates associated with testified facts. This paper introduces two datasets for temporal expression extraction from legal transcripts: a primary dataset derived from a lengthy 1995 U.S. criminal trial, and a smaller robustness-testing dataset drawn from seven other legal proceedings. We evaluate semi-supervised approaches for date entity recognition, fine-tuning neural models on weakly labeled training data, and benchmarking them against both small and large language models. Our best-performing models achieve 83% F1-score on the primary dataset (FLAIR rule-modified) and 72% F1-score on the cross-domain, small test set (BERT-cased). These results, alongside our annotated datasets and corresponding experiments, provide a foundation for developing robust date extraction and temporal ordering tools for speech-derived legal text. Moreover, we identify unique challenges for state-of-the-art NER models on legal transcripts, including legal terminology and multiple anchor date resolution.

Details

Paper ID
lrec2026-main-478
Pages
pp. 6022-6037
BibKey
goldstein-etal-2026-temporal
Editors
Stelios Piperidis, Núria Bel, Henk van den Heuvel, Nancy Ide, Simon Krek, Antonio Toral
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-493814-49-4
Conference
The Fifteenth Language Resources and Evaluation Conference (LREC 2026)
Location
Palma, Mallorca, Spain
Date
11 - 16 May 2026

Authors

  • EG

    Elizabeth J. Goldstein

  • MB

    Maria Berger

Links