Back to Main Conference 2022
LREC 2022main

A Romanization System and WebMAUS Aligner for Arabic Varieties

Proceedings of the Thirteenth International Conference on Language Resources and Evaluation (LREC 2022)

DOI:10.63317/4gt3xhvgrapx

Abstract

This paper presents the results of an ongoing collaboration to develop an Arabic variety-independent romanization system that aims to homogenize and simplify the romanization of the Arabic script, and introduces an Arabic variety-independent WebMAUS service offering a free to use forced-alignment service fully integrated within the WebMAUS services. We present the rationale for developing such a system, highlighting the need for a detailed romanization system with graphemes corresponding to the phonemic short and long vowels/consonants in Arabic varieties. We describe how the acoustic model was created, followed by several hands-on recipes for applying the forced alignment webservice either online or programatically. Finally, we discuss some of the issues we faced during the development of the system.

Details

Paper ID
lrec2022-main-789
Pages
pp. 7269-7276
BibKey
al-tamimi-etal-2022-romanization
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-38-2
Conference
Thirteenth Language Resources and Evaluation Conference
Location
Marseille, France
Date
20 June 2022 25 June 2022

Authors

  • JA

    Jalal Al-Tamimi

  • FS

    Florian Schiel

  • GK

    Ghada Khattab

  • NS

    Navdeep Sokhey

  • DA

    Djegdjiga Amazouz

  • AD

    Abdulrahman Dallak

  • HM

    Hajar Moussa

Links