Back to Main Conference 2006
LREC 2006main
Transferring Coreference Chains through Word Alignment
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC 2006)
Abstract
This paper investigates the problem of automatically annotating resources with NP coreference information using a parallel corpus, English-Romanian, in order to transfer, through word alignment, coreference chains from the English part to the Romanian part of the corpus. The results show that we can detect Romanian referential expressions and coreference chains with over 80% F-measure, thus using our method as a preprocessing step followed by manual correction as part of an annotation effort for creating a large Romanian corpus with coreference information is worthwhile.