HomeLREC 2026WorkshopsLT4HALAlrec2026-ws-lt4hala-23
Back to LT4HALA 2026
LREC 2026workshop

Overview of EvaHan2026: The First International Evaluation of Ancient Chinese OCR and Layout Analysis

Proceedings of the Fourth Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA 2026) @ LREC 2026

DOI:10.63317/3a35qx3n2fg9

Abstract

Ancient Chinese documents are vital for historical research, necessitating high-precision character recognition and layout analysis for digitization. This paper introduces EvaHan2026, the inaugural international shared task for simultaneous optical character recognition and layout parsing of ancient texts. The evaluation framework comprehensively assesses model performance across diverse calligraphic styles and complex structures, including main body text, interlinear annotations, and illustrations. Among thirteen participating teams, four successfully completed all tasks within the closed track. Experimental results reveal that character recognition accuracy reached 97.36% on engraved texts (Test Set A) and 95.71% on handwritten texts (Test Set C) when accounting for character variants. For layout recognition in complex layouts (Test Set B), the best team achieved a peak mean Average Precision (mAP) of 59.41% and an Intersection over Union (loU) of 76.38%. Our analysis indicates that calligraphic variability, layout density, and character variants significantly modulate system performance. Consequently, enhancing robustness within complex layouts and developing synergistic models that integrate textual and structural information remain primary challenges for intelligent interpretation of ancient writings .

Details

Paper ID
lrec2026-ws-lt4hala-23
Pages
pp. 244-262
BibKey
wang-etal-2026-overview
Editors
Rachele Sprugnoli, Marco Passarotti
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the Fourth Workshop on Language Technologies for Historical and Ancient Languages (LT4HALA 2026) @ LREC 2026
Location
Palma, Mallorca, Spain
Date
11 - 16 May 2026

Authors

  • DW

    Dongbo Wang

  • DZ

    Dongmei Zhu

  • JL

    Jieqiong Li

  • CL

    Chang Liu

  • RW

    Ruifeng Wu

  • FY

    Fan Yang

  • ZY

    Zhu Yue

  • XG

    Xin Gao

  • ZZ

    Zhao Zhixiao

  • XZ

    Xue Zhao

  • ZW

    Zhongzheng Wu

  • LL

    Liu Liu

  • LB

    Li Bin

Links