Back to Main Conference 2002
LREC 2002main

Experiments in Topic Detection

Proceedings of the Third International Conference on Language Resources and Evaluation (LREC 2002)

DOI:10.63317/5agbhnncc4sh

Abstract

Dividing documents into topically-coherent units and discovering their topic might have many uses. We present a system that proceeds in two steps: (1) the input text is segmented at places where there is a probable topic shift, (2) lexical chains are extracted from each segment as indicators of its topic. Two implementations, based on public domain resources, are presented: one based on WordNet and the second one based on Roget's thesaurus. An evaluation of the algorithm shows that lexical chains are acceptable as topic indicator with $44.5%$ of precision and $63.8%$ of recall.

Details

Paper ID
lrec2002-main-053
Pages
N/A
BibKey
chali-2002-experiments
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
N/A
Conference
Third International Conference on Language Resources and Evaluation
Location
Las Palmas, Spain
Date
29 May 2002 31 May 2002

Authors

  • YC

    Yllias Chali

Links