Back to Main Conference 2002
LREC 2002main

Syntactic Analysis in the Spoken Dutch Corpus (CGN)

Proceedings of the Third International Conference on Language Resources and Evaluation (LREC 2002)

DOI:10.63317/5h535agjtv8p

Abstract

The paper describes the syntactic annotation of the Spoken Dutch Corpus ("Corpus Gesproken Nederlands" or CGN), the Dutch-Flemish project (1998-2003) aiming at the collection, description and annotation of ten million words of spoken Dutch. In the first part, the background of the parsing strategy is discussed, as well as some details concerning the actual implementation of the parsing process. The second part discusses some examples of practical applications of the result of the parsing process.

Details

Paper ID
lrec2002-main-071
Pages
N/A
BibKey
van-der-wouden-etal-2002-syntactic
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
N/A
Conference
Third International Conference on Language Resources and Evaluation
Location
Las Palmas, Spain
Date
29 May 2002 31 May 2002

Authors

  • Tv

    Ton van der Wouden

  • HH

    Heleen Hoekstra

  • MM

    Michael Moortgat

  • BR

    Bram Renmans

  • IS

    Ineke Schuurman

Links