Back to Main Conference 2018
LREC 2018main
Arabic Data Science Toolkit: An API for Arabic Language Feature Extraction
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)
Abstract
We introduce Arabic Data Science Toolkit (ADST), a framework for Arabic language feature extraction, designed for data scientists that may not be familiar with Arabic or natural language processing. The functions in the toolkit allow data scientists to extend their algorithms beyond lexical or statistical methods and leverage Arabic-specific linguistic and stylistic features to enhance their systems and enable them to reach performance levels they might receive on languages with more resources, or languages with which they have more familiarity.