Back to Main Conference 2000
LREC 2000main
Acquisition of Linguistic Patterns for Knowledge-based Information Extraction
Proceedings of the Second International Conference on Language Resources and Evaluation (LREC 2000)
Abstract
In this paper we present a new method of automatic acquisition of linguistic patterns for Information Extraction, as implemented in the CICERO system. Our approach combines lexico-semantic information available from the WordNet database with collocating data extracted from training corpora. Due to the open-domain nature of the WordNet information and the immediate availability of large collections of texts, our method can be easily ported to open-domain Information Extraction.