Back to Main Conference 2002
LREC 2002main
The TASX-environment: an XML-based toolset for time aligned speech corpora
Proceedings of the Third International Conference on Language Resources and Evaluation (LREC 2002)
Abstract
This paper describes the design and implementation of an XML-based corpus environment for multi-tier annotated speech data. The TASX-environment (TASX: Time Aligned Signal data eXchange format) constitutes the technical basis for a corpus designed to explore the acquisition of prosody by second language learners. It supports all aspects of the corpus setup procedure: XML-based annotation of the speech data, all transformation of non XML-annotations, and the web-based analysis and dissemination of thedata.