Back to Main Conference 2012
LREC 2012main

Strategies to Improve a Speaker Diarisation Tool

Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC 2012)

DOI:10.63317/27n86p56irgs

Abstract

This paper describes the different strategies used to improve the results obtained by an off-line speaker diarisation tool with the Albayzin 2010 diarisation database. The errors made by the system have been analyzed and different strategies have been proposed to reduce each kind of error. Very short segments incorrectly labelled and different appearances of one speaker labelled with different identifiers are the most common errors. A post-processing module that refines the segmentation by retraining the GMM models of the speakers involved has been built to cope with these errors. This post-processing module has been tuned with the training dataset and improves the result of the diarisation system by 16.4% in the test dataset.

Details

Paper ID
lrec2012-main-413
Pages
pp. 4117-4121
BibKey
tavarez-etal-2012-strategies
Editor
N/A
Publisher
European Language Resources Association (ELRA)
ISSN
2522-2686
ISBN
978-2-9517408-7-7
Conference
Eighth International Conference on Language Resources and Evaluation
Location
Istanbul, Turkey
Date
21 May 2012 27 May 2012

Authors

  • DT

    David Tavarez

  • EN

    Eva Navas

  • DE

    Daniel Erro

  • IS

    Ibon Saratxaga

Links