Back to Main Conference 2004
LREC 2004main
Generating an Arabic Full-form Lexicon for Bidirectional Morphology Lookup
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004)
Abstract
We describe the generation of an Arabic full-form lexicon and its conversion into a two-level Finite State Transducer (FST) for morphology analysis and generation. The implementation of morphological lookup is based on a representation of the relevant data in the form of a FST, for which generic implementations exist that facilitate the integration into larger software systems for natural language processing. We show the feasibility of our encoding and the analysis of both vowelled and unvowelled Arabic words.