Back to Main Conference 2016
LREC 2016main

Can Topic Modelling benefit from Word Sense Information?

Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016)

DOI:10.63317/44wgcy4fukwf

Abstract

This paper proposes a new topic model that exploits word sense information in order to discover less redundant and more informative topics. Word sense information is obtained from WordNet and the discovered topics are groups of synsets, instead of mere surface words. A key feature is that all the known senses of a word are considered, with their probabilities. Alternative configurations of the model are described and compared to each other and to LDA, the most popular topic model. However, the obtained results suggest that there are no benefits of enriching LDA with word sense information.

Details

Paper ID
lrec2016-main-540
Pages
pp. 3387-3393
BibKey
ferrugento-etal-2016-topic
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-9-1
Conference
Tenth International Conference on Language Resources and Evaluation
Location
Portorož, Slovenia
Date
23 May 2016 28 May 2016

Authors

  • AF

    Adriana Ferrugento

  • HO

    Hugo Gonçalo Oliveira

  • AA

    Ana Alves

  • FR

    Filipe Rodrigues

Links