Back to Main Conference 2002
LREC 2002main
SAM: System for Multi-criteria Text Alignment.
Proceedings of the Third International Conference on Language Resources and Evaluation (LREC 2002)
Abstract
The problem of text alignment is to establish the correspondence between subparts of two ore more translations or versions of the same document. Most of the methods used in alignment are based on the statistical analysis of word or character frequencies or of string occurrences. In order to achieve more accurate results, other methods have incorporated some structural properties of the documents as further criteria.