Back to Main Conference 2004
LREC 2004main

Categorizing Web Pages as a Preprocessing Step for Information Extraction

Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004)

DOI:10.63317/2uvbfc8ovrtu

Abstract

At present, information systems combining crawling and information extraction (IE) technologies acquire a lot of research and industrial interest. In this paper, we present an algorithm that exploits techniques for unsupervised IE pattern acquisition in order to facilitate identification of web pages containing information relevant to the IE task.

Details

Paper ID
lrec2004-main-328
Pages
N/A
BibKey
pekar-etal-2004-categorizing
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
2-9517408-1-6
Conference
Fourth International Conference on Language Resources and Evaluation
Location
Lisbon, Portugal
Date
26 May 2004 28 May 2004

Authors

  • VP

    Viktor Pekar

  • RE

    Richard Evans

  • RM

    Ruslan Mitkov

Links