Back to Main Conference 2012
LREC 2012main
An NLP Curator (or: How I Learned to Stop Worrying and Love NLP Pipelines)
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC 2012)
Abstract
Natural Language Processing continues to grow in popularity in a range of research and commercial applications, yet managing the wide array of potential NLP components remains a difficult problem. This paper describes Curator, an NLP management framework designed to address some common problems and inefficiencies associated with building NLP process pipelines; and Edison, an NLP data structure library in Java that provides streamlined interactions with Curator and offers a range of useful supporting functionality.