Back to Main Conference 2000
LREC 2000main
The American National Corpus: A Standardized Resource for American English
Proceedings of the Second International Conference on Language Resources and Evaluation (LREC 2000)
Abstract
At the first conference on Language Resources and Evaluation, Granada 1998, Charles Fillmore, Nancy Ide, Daniel Jurafsky, and Catherine Macleod proposed creating an American National Corpus (ANC) that would compare with the British National Corpus (BNC) both in balance and in size (one hundred million words). This paper reports on the progress made over the past two years in launching the project. At present, the ANC project is well underway, with commitments for support and contribution of texts from a number of publishers world-wide.