Back to Main Conference 2012
LREC 2012main

Leveraging study of robustness and portability of spoken language understanding systems across languages and domains: the PORTMEDIA corpora

Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC 2012)

DOI:10.63317/47a7b39jc4gi

Abstract

The PORTMEDIA project is intended to develop new corpora for the evaluation of spoken language understanding systems. The newly collected data are in the field of human-machine dialogue systems for tourist information in French in line with the MEDIA corpus. Transcriptions and semantic annotations, obtained by low-cost procedures, are provided to allow a thorough evaluation of the systems' capabilities in terms of robustness and portability across languages and domains. A new test set with some adaptation data is prepared for each case: in Italian as an example of a new language, for ticket reservation as an example of a new domain. Finally the work is complemented by the proposition of a new high level semantic annotation scheme well-suited to dialogue data.

Details

Paper ID
lrec2012-main-438
Pages
pp. 1436-1442
BibKey
lefevre-etal-2012-leveraging
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-7-7
Conference
Eighth International Conference on Language Resources and Evaluation
Location
Istanbul, Turkey
Date
21 May 2012 27 May 2012

Authors

  • FL

    Fabrice Lefèvre

  • DM

    Djamel Mostefa

  • LB

    Laurent Besacier

  • YE

    Yannick Estève

  • MQ

    Matthieu Quignard

  • NC

    Nathalie Camelin

  • BF

    Benoit Favre

  • BJ

    Bassam Jabaian

  • LR

    Lina M. Rojas-Barahona

Links