Back to Main Conference 2008
LREC 2008main
An Approach to Modeling Heterogeneous Resources for Information Extraction
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC 2008)
Abstract
In this paper, we describe an approach that aims to model heterogeneous resources for information extraction. Document is modeled in graph representation that enables better understanding of multi-media document and its structure which ultimately could result better cross-media information extraction. We also describe our proposed algorithm that segment document-based on the document modeling approach we described in this paper.