Back to Main Conference 2016
LREC 2016main

Ambiguity Diagnosis for Terms in Digital Humanities

Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)

DOI:10.63317/4uwf5uh8a5fa

Abstract

Among all researches dedicating to terminology and word sense disambiguation, little attention has been devoted to the ambiguity of term occurrences. If a lexical unit is indeed a term of the domain, it is not true, even in a specialised corpus, that all its occurrences are terminological. Some occurrences are terminological and other are not. Thus, a global decision at the corpus level about the terminological status of all occurrences of a lexical unit would then be erroneous. In this paper, we propose three original methods to characterise the ambiguity of term occurrences in the domain of social sciences for French. These methods differently model the context of the term occurrences: one is relying on text mining, the second is based on textometry, and the last one focuses on text genre properties. The experimental results show the potential of the proposed approaches and give an opportunity to discuss about their hybridisation.

Details

Paper ID
lrec2016-main-690
Pages
pp. 4353-4359
BibKey
daille-etal-2016-ambiguity
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-9-1
Conference
Tenth International Conference on Language Resources and Evaluation
Location
Portorož, Slovenia
Date
23 May 2016 28 May 2016

Authors

  • BD

    Béatrice Daille

  • EJ

    Evelyne Jacquey

  • GL

    Gaël Lejeune

  • LM

    Luis Felipe Melo

  • YT

    Yannick Toussaint

Links