HomeLREC 2026WorkshopsCHIPSALlrec2026-ws-chipsal-16
Back to CHIPSAL 2026
LREC 2026workshop

Reward-Guided Fine-Tuning of Whisper for Low-Resource Nepali Speech Recognition

Proceedings of the Second workshop on Challenges in Processing South Asian Languages (CHiPSAL2026)

DOI:10.63317/45u9hjmchyhh

Abstract

Fine tuning speech recognition models on noisy real world data is tricky. The model has no way of knowing which training samples are reliable and which are not, so it ends up learning from bad examples just as readily as good ones. This is a real problem for Nepali, where most available training data comes from YouTube videos with automatically generated subtitles that are often inaccurate. In this work, we tried a simple fix. Instead of feeding everything to the model, we first asked humans to rate the quality of a sample of transcriptions, trained a small Random Forest classifier on those 2,000 ratings, and used it to filter out the bad samples before each retraining round. The classifier uses four automatically computable features, Word Error Rate (WER), Character Error Rate (CER), length ratio, and length difference, and achieves 81% accuracy on a held out set. Running two filtering and retraining cycles on a 40,000 clip training subset drawn from a 68.4 hour corpus improves substantially over our own standard fine tuning baseline of 5.60% WER and 5.10% CER, reaching 4.89% WER and 4.52% CER, which corresponds to an 11 to 13% relative gain. The approach is much lighter than full Reinforcement Learning from Human Feedback but still uses real human judgment to guide training.

Details

Paper ID
lrec2026-ws-chipsal-16
Pages
pp. 169-175
BibKey
pandit-etal-2026-reward
Editors
Kengatharaiyer Sarveswaran, Ashwini Vaidya
Publisher
European Language Resources Association (ELRA)
ISSN
N/A
ISBN
N/A
Workshop
Proceedings of the Second workshop on Challenges in Processing South Asian Languages (CHiPSAL2026)
Location
Palma, Mallorca, Spain
Date
11 - 16 May 2026

Authors

  • AP

    Aadarsh Pandit

  • YK

    Yudhin Khanal

  • IP

    Ishan Pandey

  • KK

    Kushal Kunwar

  • SR

    Sunil Regmi

Links