HomeLREC 2020WorkshopsCOMPUTERMlrec2020-ws-computerm-09
Back to COMPUTERM 2020
LREC 2020workshop

Towards Automatic Thesaurus Construction and Enrichment.

Proceedings of the 6th International Workshop on Computational Terminology

DOI:10.63317/5bkkh3ezp26v

Abstract

Thesaurus construction with minimum human efforts often relies on automatic methods to discover terms and their relations. Hence, the quality of a thesaurus heavily depends on the chosen methodologies for: (i) building its content (terminology extraction task) and (ii) designing its structure (semantic similarity task). The performance of the existing methods on automatic thesaurus construction is still less accurate than the handcrafted ones of which is important to highlight the drawbacks to let new strategies build more accurate thesauri models. In this paper, we will provide a systematic analysis of existing methods for both tasks and discuss their feasibility based on an Italian Cybersecurity corpus. In particular, we will provide a detailed analysis on how the semantic relationships network of a thesaurus can be automatically built, and investigate the ways to enrich the terminological scope of a thesaurus by taking into account the information contained in external domain-oriented semantic sets.

Details

Paper ID
lrec2020-ws-computerm-09
Pages
pp. 62-71
BibKey
hazem-etal-2020-towards
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the 6th International Workshop on Computational Terminology
Location
undefined, undefined
Date
11 May 2020 16 May 2020

Authors

  • AH

    Amir Hazem

  • BD

    Beatrice Daille

  • LC

    Lanza Claudia

Links