Title

SLR Validation: Current Trends and Developments

Author(s)

Henk van den Heuvel, Dorota Iskra, Eric Sanders, Folkert de Vriend

SPEX (Speech Processing Expertise Centre), Department of Language and Speech, Nijmegen, the Netherlands

Session

P9-SE

Abstract

This paper deals with the quality evaluation (validation) of Spoken Language Resources (SLR). The current situation in terms of relevant validation criteria and procedures is briefly presented. Next, a number of validation issues related to new data formats (XML-based annotations, UTF-16 encoding) are discussed. Further, new validation cycles that were introduced in a series of new projects like SpeeCon and OrienTel are addressed: prompt sheet validation, lexicon validation and pre-release validation. Finally, SPEX's current and future

Keyword(s)

speech databases, validation, XML

Language(s)

Multiple

Full Paper

328.pdf