Back to Main Conference 2024
LREC-COLING 2024main

RoboVox: A Single/Multi-channel Far-field Speaker Recognition Benchmark for a Mobile Robot

Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

DOI:10.63317/4imc9rcapmge

Abstract

In this paper, we introduce a new far-field speaker recognition benchmark called RoboVox. RoboVox is a French corpus recorded by a mobile robot. The files are recorded from different distances under severe acoustical conditions with the presence of several types of noise and reverberation. In addition to noise and reverberation, the robot’s internal noise acts as an extra additive noise. RoboVox can be used for both single-channel and multi-channel speaker recognition. In the evaluation protocols, we are considering both cases. The obtained results demonstrate a significant decline in performance in far-filed speaker recognition and urge the community to further research in this domain

Details

Paper ID
lrec2024-main-1234
Pages
pp. 14152-14156
BibKey
mohammadamini-etal-2024-robovox
Editor
N/A
Publisher
European Language Resources Association (ELRA) and ICCL
ISSN
2522-2686
ISBN
979-10-95546-34-4
Conference
Joint International Conference on Computational Linguistics, Language Resources and Evaluation
Location
Turin, Italy
Date
20 May 2024 25 May 2024

Authors

  • MM

    Mohammad Mohammadamini

  • DM

    Driss Matrouf

  • MR

    Michael Rouvier

  • JB

    Jean-Francois Bonastre

  • RS

    Romain Serizel

  • TG

    Theophile Gonos

Links