Back to Main Conference 2012
LREC 2012main

DECODA: a call-centre human-human spoken conversation corpus

Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC 2012)

DOI:10.63317/4bmi4j984qtv

Abstract

The goal of the DECODA project is to reduce the development cost of Speech Analytics systems by reducing the need for manual annotat ion. This project aims to propose robust speech data mining tools in the framework of call-center monitoring and evaluation, by means of weakl y supervised methods. The applicative framework of the project is the call-center of the RATP (Paris public transport authority). This project tackles two very important open issues in the development of speech mining methods from spontaneous speech recorded in call-centers : robus tness (how to extract relevant information from very noisy and spontaneous speech messages) and weak supervision (how to reduce the annotation effort needed to train and adapt recognition and classification models). This paper describes the DECODA corpus collected at the RATP during the project. We present the different annotation levels performed on the corpus, the methods used to obtain them, as well as some evaluation o f the quality of the annotations produced.

Details

Paper ID
lrec2012-main-399
Pages
pp. 1343-1347
BibKey
bechet-etal-2012-decoda
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-7-7
Conference
Eighth International Conference on Language Resources and Evaluation
Location
Istanbul, Turkey
Date
21 May 2012 27 May 2012

Authors

  • FB

    Frederic Bechet

  • BM

    Benjamin Maza

  • NB

    Nicolas Bigouroux

  • TB

    Thierry Bazillon

  • ME

    Marc El-Bèze

  • RD

    Renato De Mori

  • EA

    Eric Arbillot

Links