Back to Main Conference 2002
LREC 2002main

Linguistic and Computational Problems for the Creation of an Italian Children’s Corpus of Spoken Language

Proceedings of the Third International Conference on Language Resources and Evaluation (LREC 2002)

DOI:10.63317/29zd2jx7tk75

Abstract

In this paper we describe the criteria adopted for the creation of a corpus of spoken language produced by children of six to eleven years of age in different communicative situations, the methodology used for the collection of data, the transcription, coding and lemmatization phases. We also give some quantitative descriptions about nouns, verbs and adjectives present in the corpus. Qualitative analyses on the adjectives are underway. This work is to be included among the activities carried out within the framework of the "Corpus di Linguaggio Infantile" (C.L.I.), a special project of the Italian National Research Council (CNR).

Details

Paper ID
lrec2002-main-192
Pages
N/A
BibKey
pecchia-etal-2002-linguistic
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
N/A
Conference
Third International Conference on Language Resources and Evaluation
Location
Las Palmas, Spain
Date
29 May 2002 31 May 2002

Authors

  • LP

    Laura Pecchia

  • GC

    Giuseppe Cappelli

  • EG

    Elisabetta Guazzini

Links