Back to Main Conference 2022
LREC 2022main

A Romanization System and WebMAUS Aligner for Arabic Varieties

Proceedings of the Thirteenth International Conference on Language Resources and Evaluation (LREC 2022)

DOI:10.63317/4gt3xhvgrapx

Abstract

This paper presents the results of an ongoing collaboration to develop an Arabic variety-independent romanization system that aims to homogenize and simplify the romanization of the Arabic script, and introduces an Arabic variety-independent WebMAUS service offering a free to use forced-alignment service fully integrated within the WebMAUS services. We present the rationale for developing such a system, highlighting the need for a detailed romanization system with graphemes corresponding to the phonemic short and long vowels/consonants in Arabic varieties. We describe how the acoustic model was created, followed by several hands-on recipes for applying the forced alignment webservice either online or programatically. Finally, we discuss some of the issues we faced during the development of the system.

Details

Paper ID
lrec2022-main-789
Pages
pp. 7269-7276
BibKey
al-tamimi-etal-2022-romanization
Editors
Nicoletta Calzolari, Frédéric Béchet, Philippe Blache, Khalid Choukri, Christopher Cieri, Thierry Declerck, Sara Goggi, Hitoshi Isahara, Bente Maegaard, Joseph Mariani, Hélène Mazo, Jan Odijk, Stelios Piperidis2020
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-38-2
Conference
Thirteenth Language Resources and Evaluation Conference
Location
Marseille, France
Date
20 - 25 June 2022

Authors

  • JA

    Jalal Al-Tamimi

  • FS

    Florian Schiel

  • GK

    Ghada Khattab

  • NS

    Navdeep Sokhey

  • DA

    Djegdjiga Amazouz

  • AD

    Abdulrahman Dallak

  • HM

    Hajar Moussa

Links