Back to Main Conference 2014
LREC 2014main

Extending HeidelTime for Temporal Expressions Referring to Historic Dates

Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014)

DOI:10.63317/2ojpcg227dtv

Abstract

Research on temporal tagging has achieved a lot of attention during the last years. However, most of the work focuses on processing news-style documents. Thus, references to historic dates are often not well handled by temporal taggers although they frequently occur in narrative-style documents about history, e.g., in many Wikipedia articles. In this paper, we present the AncientTimes corpus containing documents about different historic time periods in eight languages, in which we manually annotated temporal expressions. Based on this corpus, we explain the challenges of temporal tagging documents about history. Furthermore, we use the corpus to extend our multilingual, cross-domain temporal tagger HeidelTime to extract and normalize temporal expressions referring to historic dates, and to demonstrate HeidelTime’s new capabilities. Both, the AncientTimes corpus as well as the new HeidelTime version are made publicly available.

Details

Paper ID
lrec2014-main-655
Pages
pp. 2390-2397
BibKey
strotgen-etal-2014-extending
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-8-4
Conference
Ninth International Conference on Language Resources and Evaluation
Location
Reykjavik, Iceland
Date
26 May 2014 31 May 2014

Authors

  • JS

    Jannik Strötgen

  • TB

    Thomas Bögel

  • JZ

    Julian Zell

  • AA

    Ayser Armiti

  • TC

    Tran Van Canh

  • MG

    Michael Gertz

Links