Putting the Dutch PAROLE Corpus to Work

Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC 2004)

Abstract

We discuss the activities towards the development of the retrieval application of the Dutch PAROLE Corpus. Compared to the other corpora developed by INL, the PAROLE Corpus has been encoded with more extended types of metadata, conformant to the TEI standard for text encoding. A search engine and a web-based user interface, both newly developed by INL, provide the user with the functionality to explore the corpus, not only at the level of the text, but also at the level of the metadata or a combination of the two. In view of our experience with corpus retrieval, we did not follow the complete system development cycle, but used an alternative method instead.