Back to Main Conference 2018
LREC 2018main

Arabic Data Science Toolkit: An API for Arabic Language Feature Extraction

Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

DOI:10.63317/2qcsskyrb6m5

Abstract

We introduce Arabic Data Science Toolkit (ADST), a framework for Arabic language feature extraction, designed for data scientists that may not be familiar with Arabic or natural language processing. The functions in the toolkit allow data scientists to extend their algorithms beyond lexical or statistical methods and leverage Arabic-specific linguistic and stylistic features to enhance their systems and enable them to reach performance levels they might receive on languages with more resources, or languages with which they have more familiarity.

Details

Paper ID
lrec2018-main-198
Pages
N/A
BibKey
rodrigues-etal-2018-arabic
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
79-10-95546-00-9
Conference
Eleventh International Conference on Language Resources and Evaluation
Location
Miyazaki, Japan
Date
7 May 2018 12 May 2018

Authors

  • PR

    Paul Rodrigues

  • VN

    Valerie Novak

  • CR

    C. Anton Rytting

  • JY

    Julie Yelle

  • JB

    Jennifer Boutz

Links