Back to Main Conference 2006
LREC 2006main
The Impact of Annotation on the Performance of Protein Tagging in Biomedical Text
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC 2006)
Abstract
In this paper we discuss five different corpora annotated forprotein names. We present several within- and cross-dataset proteintagging experiments showing that different annotation schemes severelyaffect the portability of statistical protein taggers. By means of adetailed error analysis we identify crucial annotation issues thatfuture annotation projects should take into careful consideration.