Back to Main Conference 2014
LREC 2014main

Morphological parsing of Swahili using crowdsourced lexical resources

Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014)

DOI:10.63317/3o7gcvqkiaid

Abstract

We describe a morphological analyzer for the Swahili language, written in an extension of XFST/LEXC intended for the easy declaration of morphophonological patterns and importation of lexical resources. Our analyzer was supplemented extensively with data from the Kamusi Project (kamusi.org), a user-contributed multilingual dictionary. Making use of this resource allowed us to achieve wide lexical coverage quickly, but the heterogeneous nature of user-contributed content also poses some challenges when adapting it for use in an expert system.

Details

Paper ID
lrec2014-main-686
Pages
pp. 3333-3339
BibKey
littell-etal-2014-morphological
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-8-4
Conference
Ninth International Conference on Language Resources and Evaluation
Location
Reykjavik, Iceland
Date
26 May 2014 31 May 2014

Authors

  • PL

    Patrick Littell

  • KP

    Kaitlyn Price

  • LL

    Lori Levin

Links