HomeLREC 2026WorkshopsNAKBANLPlrec2026-ws-nakbanlp-39
Back to NAKBANLP 2026
LREC 2026workshop

AlSaifTeam at AR-MS NAKBA-NLP 2026: Building Expert-Quality Ground Truth for Arabic Handwritten Manuscripts

Proceedings of the 2nd International Workshop on Nakba Narratives as Language Resources @ LREC 2026

DOI:10.63317/5pcgw3fzbc6i

Abstract

This paper describes our participation in Subtask 1 of the NAKBA NLP 2026 Arabic Manuscript Understanding Shared Task, which focuses on the manual creation of expert-quality, line-level transcriptions for Arabic handwritten manuscripts. To ensure reliable ground truth, we adopt a protocol-driven methodology based on fixed transcription rules, collaborative verification, and confidence-based quality control. The proposed approach aims to improve consistency, reduce annotation bias, and support the creation of trustworthy benchmark resources for future Arabic OCR and HTR research. Keywords:Arabic handwritten manuscripts, ground truth construction, manual transcription, handwritten text recognition, optical character recognition, benchmark enrichment

Details

Paper ID
lrec2026-ws-nakbanlp-39
Pages
pp. 258-260
BibKey
alsaif-etal-2026-alsaifteam
Editors
Mustafa Jarrar, Mo El-Haj, Amal Haddad, Serin Atiani, Shadi Abudalfa, Terry Regier, Paul Rayson, Khalil Sima’an, Camille Mansour
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the 2nd International Workshop on Nakba Narratives as Language Resources @ LREC 2026
Location
Palma, Mallorca, Spain
Date
11 - 16 May 2026

Authors

  • JA

    Joud Fahad AlSaif

  • AH

    Alhasan Mohammad Hamood

  • JA

    Jana Mohammad Alseed

Links