Back to Main Conference 2010
LREC 2010main
A Morphologically-Analyzed CHILDES Corpus of Hebrew
Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC 2010)
Abstract
We present a corpus of transcribed spoken Hebrew that forms an integral part of a comprehensive data system that has been developed to suit the specific needs and interests of child language researchers: CHILDES (Child Language Data Exchange System). We introduce a dedicated transcription scheme for the spoken Hebrew data that is aware both of the phonology and of the standard orthography of the language. We also introduce a morphological analyzer that was specifically developed for this corpus.