HomeLREC 2026WorkshopsNAKBANLPlrec2026-ws-nakbanlp-35
Back to NAKBANLP 2026
LREC 2026workshop

Oblevit at AR-MS NAKBA NLP 2026 Subtask 2: Hybrid CNN–BiLSTM–CTC Framework with Linguistic Refinement for Arabic Handwritten Manuscript Recognition

Proceedings of the 2nd International Workshop on Nakba Narratives as Language Resources @ LREC 2026

DOI:10.63317/5nwur655ha5k

Abstract

Arabic handwritten manuscript recognition is challenging due to the cursive nature of the script, dot ambiguity, and document degradation. In this work, we propose an end-to-end OCR system based on a CNN–BiLSTM–CTC architecture. The model extracts visual features, captures sequential dependencies, and performs alignment-free training. Arabic-specific decoding and post-processing techniques are applied to reduce character and spacing errors. Experimental results show competitive performance in recognizing complex handwritten Arabic text.

Details

Paper ID
lrec2026-ws-nakbanlp-35
Pages
pp. 234-238
BibKey
juhaysh-etal-2026-oblevit
Editors
Mustafa Jarrar, Mo El-Haj, Amal Haddad, Serin Atiani, Shadi Abudalfa, Terry Regier, Paul Rayson, Khalil Sima’an, Camille Mansour
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the 2nd International Workshop on Nakba Narratives as Language Resources @ LREC 2026
Location
Palma, Mallorca, Spain
Date
11 - 16 May 2026

Authors

  • RJ

    Reem Juhaysh

  • AA

    Abuelgasim Sami Abusonoun

  • SA

    Sara Ayad

Links