Back to Main Conference 2008
LREC 2008main

A Multi-Word Term Extraction Program for Arabic Language

Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC 2008)

DOI:10.63317/4eafyvur9wua

Abstract

Terminology extraction commonly includes two steps: identification of term-like units in the texts, mostly multi-word phrases, and the ranking of the extracted term-like units according to their domain representativity. In this paper, we design a multi-word term extraction program for Arabic language. The linguistic filtering performs a morphosyntactic analysis and takes into account several types of variations. The domain representativity is measure thanks to statistical scores. We evalutate several association measures and show that the results we otained are consitent with those obtained for Romance languages.

Details

Paper ID
lrec2008-main-155
Pages
N/A
BibKey
boulaknadel-etal-2008-multi
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-4-0
Conference
Sixth International Conference on Language Resources and Evaluation
Location
Marrakech, Morocco
Date
28 May 2008 30 May 2008

Authors

  • SB

    Siham Boulaknadel

  • BD

    Beatrice Daille

  • DA

    Driss Aboutajdine

Links