Back to Main Conference 2024
LREC-COLING 2024main

CB-Whisper: Contextual Biasing Whisper Using Open-Vocabulary Keyword-Spotting

Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)

DOI:10.63317/3z6frdixwnmr

Abstract

End-to-end automatic speech recognition (ASR) systems often struggle to recognize rare name entities, such as personal names, organizations and terminologies that are not frequently encountered in the training data. This paper presents Contextual Biasing Whisper (CB-Whisper), a novel ASR system based on OpenAI’s Whisper model that can recognize user-defined name entities by performing open-vocabulary keyword-spotting (KWS) before the decoder. The KWS module leverages text-to-speech (TTS) techniques and a convolutional neural network (CNN) classifier to match the features between the entities and the utterances. To integrate the recognized entities into the Whipser decoder and avoid hallucinations, we carefully crafted multiple prompts with spoken form hints. Experiments show that the KWS module based on Whisper encoder’s features can recognize unseen user-defined keywords effectively. More importantly, the proposed CB-Whisper substantially improves the mixed-error-rate (MER) and entity recall compared to the original Whisper model on three internal datasets and two publicly available datasets including Aishell and ACL datasets that cover English-only, Chinese-only, and code-switching scenarios.

Details

Paper ID
lrec2024-main-0262
Pages
pp. 2941-2946
BibKey
li-etal-2024-cb
Editor
N/A
Publisher
European Language Resources Association (ELRA) and ICCL
ISSN
2522-2686
ISBN
979-10-95546-34-4
Conference
Joint International Conference on Computational Linguistics, Language Resources and Evaluation
Location
Turin, Italy
Date
20 May 2024 25 May 2024

Authors

  • YL

    Yuang Li

  • YL

    Yinglu Li

  • MZ

    Min Zhang

  • CS

    Chang Su

  • JY

    Jiawei Yu

  • MP

    Mengyao Piao

  • XQ

    Xiaosong Qiao

  • MM

    Miaomiao Ma

  • YZ

    Yanqing Zhao

  • HY

    Hao Yang

Links