Back to Main Conference 2006
LREC 2006main
Annotating COMPARA, a Grammar-aware Parallel Corpus
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC 2006)
Abstract
In this paper we describe the annotation of COMPARA, currently the largest post-edited parallel corpora which include Portuguese. We describe the motivation, the results so far, and the way the corpus is being annotated. We also provide the first grounded results about syntactical ambiguity in Portuguese. Finally, we discuss some interesting problems in this connection.