Whisper-SV: Adapting Whisper for low-data-resource speaker verification

L Zhang, N Jiang, Q Wang, Y Li, Q Lu, L Xie - Speech Communication, 2024 - Elsevier
Trained on 680,000 h of massive speech data, Whisper is a multitasking, multilingual
speech foundation model demonstrating superior performance in automatic speech …

[PDF][PDF] Teager Energy Cepstral Coefficients for Spoken Language Identification

AJ Shah, SH Yadav, HA Patil - apsipa2024.org
Spoken Language Identification (SLID) is a key component in audio processing that
facilitates the recognition and understanding of audio clips of spoken languages. Various …