Back to Main Conference 2002
LREC 2002main
Extracting Information for Automatic Indexing of Multimedia Material
Proceedings of the Third International Conference on Language Resources and Evaluation (LREC 2002)
Abstract
This paper discusses our work on information extraction (IE) from multi-lingual, multi-media, multi-genre Language Resources, in a domain where there are many different event types. This work is being carried out in the context of MUMIS, an EU-funded project that aims at the development of basic technology for the creation of a composite index from multiple and multi-lingual sources. Our approach to IE relies on a finite state machinery provided by GATE, a General Architecture for Text Engineering, pipelined with full syntactic analysis and discourse interpretation implemented in Prolog.