Back to Main Conference 2002
LREC 2002main

Information Extraction from Text Corpora: Using Filters on Collocation Sets

Proceedings of the Third International Conference on Language Resources and Evaluation (LREC 2002)

DOI:10.63317/46r8n22mzjhq

Abstract

This paper describes the application of filtering techniques to collocation sets calculated for very large text corpora. Additional information like patterns, grammatical information, subject areas and numerical values associated with the collocations are used to identify collocations with given semantic structure. Various examples and different techniques for applying such filters are described. We also give several examples of practical applications for this type of information extraction.

Details

Paper ID
lrec2002-main-299
Pages
N/A
BibKey
heyer-etal-2002-information
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
N/A
Conference
Third International Conference on Language Resources and Evaluation
Location
Las Palmas, Spain
Date
29 May 2002 31 May 2002

Authors

  • GH

    Gerhard Heyer

  • UQ

    Uwe Quasthoff

  • CW

    Christian Wolff

Links