C Kim, RM Stern - IEEE/ACM Transactions on audio, speech …, 2016 - ieeexplore.ieee.org
This paper presents a new feature extraction algorithm called power normalized Cepstral coefficients (PNCC) that is motivated by auditory processing. Major new features of PNCC …
From the available biometric technologies, automatic speaker recognition is one of the most convenient and accessible ones due to abundance of mobile devices equipped with a …
A Jansen, B Van Durme - 2011 IEEE Workshop on Automatic …, 2011 - ieeexplore.ieee.org
Spoken term discovery is the task of automatically identifying words and phrases in speech data by searching for long repeated acoustic patterns. Initial solutions relied on exhaustive …
Mel-frequency cepstral coefficients (MFCC) have been dominantly used in speaker recognition as well as in speech recognition. However, based on theories in speech …
MA Carlin, S Thomas, A Jansen… - … Annual Conference of …, 2011 - academia.edu
Acoustic front-ends are typically developed for supervised learning tasks and are thus optimized to minimize word error rate, phone error rate, etc. However, in recent efforts to …
This paper examines the individual and combined impacts of various front-end approaches on the performance of deep neural network (DNN) based speech recognition systems in …
The language identification (LID) task in the Robust Automatic Transcription of Speech (RATS) program is challenging due to the noisy nature of the audio data collected over …
G Mantena, S Achanta… - IEEE/ACM Transactions on …, 2014 - ieeexplore.ieee.org
The task of query-by-example spoken term detection (QbE-STD) is to find a spoken query within spoken audio data. Current state-of-the-art techniques assume zero prior knowledge …
A Purushothaman, D Dutta, R Kumar… - … /ACM Transactions on …, 2023 - ieeexplore.ieee.org
Speech applications in far-field real world settings often deal with signals that are corrupted by reverberation. The task of dereverberation constitutes an important step to improve the …