Back to Main Conference 2014
LREC 2014main

Large Scale Arabic Error Annotation: Guidelines and Framework

Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014)

DOI:10.63317/5mvsos89i9i9

Abstract

We present annotation guidelines and a web-based annotation framework developed as part of an effort to create a manually annotated Arabic corpus of errors and corrections for various text types. Such a corpus will be invaluable for developing Arabic error correction tools, both for training models and as a gold standard for evaluating error correction algorithms. We summarize the guidelines we created. We also describe issues encountered during the training of the annotators, as well as problems that are specific to the Arabic language that arose during the annotation process. Finally, we present the annotation tool that was developed as part of this project, the annotation pipeline, and the quality of the resulting annotations.

Details

Paper ID
lrec2014-main-721
Pages
N/A
BibKey
zaghouani-etal-2014-large
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-8-4
Conference
Ninth International Conference on Language Resources and Evaluation
Location
Reykjavik, Iceland
Date
26 May 2014 31 May 2014

Authors

  • WZ

    Wajdi Zaghouani

  • BM

    Behrang Mohit

  • NH

    Nizar Habash

  • OO

    Ossama Obeid

  • NT

    Nadi Tomeh

  • AR

    Alla Rozovskaya

  • NF

    Noura Farra

  • SA

    Sarah Alkuhlani

  • KO

    Kemal Oflazer

Links