A survey of speaker recognition: Fundamental theories, recognition methods and opportunities

MM Kabir, MF Mridha, J Shin, I Jahan, AQ Ohi - IEEE Access, 2021 - ieeexplore.ieee.org
Humans can identify a speaker by listening to their voice, over the telephone, or on any
digital devices. Acquiring this congenital human competency, authentication technologies …

Speaker identification features extraction methods: A systematic review

SS Tirumala, SR Shahamiri, AS Garhwal… - Expert Systems with …, 2017 - Elsevier
Speaker Identification (SI) is the process of identifying the speaker from a given utterance by
comparing the voice biometrics of the utterance with those utterance models stored …

Speech enhancement with EMD and hurst-based mode selection

L Zão, R Coelho, P Flandrin - IEEE/ACM Transactions on Audio …, 2014 - ieeexplore.ieee.org
This paper presents a speech enhancement technique for signals corrupted by
nonstationary acoustic noises. The proposed approach applies the empirical mode …

Optimal and unbiased filtering with colored process noise using state differencing

YS Shmaliy, S Zhao, CK Ahn - IEEE Signal Processing Letters, 2019 - ieeexplore.ieee.org
This letter develops the Kalman and unbiased finite impulse response filtering algorithms for
linear discrete-time state-space models with Gauss-Markov colored process noise (CPN) …

Adaptive wavelet shrinkage for noise robust speaker recognition

SM Govindan, P Duraisamy, X Yuan - Digital Signal Processing, 2014 - Elsevier
Speaker recognition faces many practical difficulties, among which signal inconsistency due
to environmental and acquisition channel factors is most challenging. The noise imposed to …

On speech features fusion, α-integration Gaussian modeling and multi-style training for noise robust speaker classification

A Venturini, L Zao, R Coelho - IEEE/ACM Transactions on …, 2014 - ieeexplore.ieee.org
This paper investigates the fusion of Mel-frequency cepstral coefficients (MFCC) and
statistical pH features to improve the performance of speaker verification (SV) in non …

[PDF][PDF] Speech enhancement using sliding window empirical mode decomposition and hurst-based technique

S Poovarasan, E Chandra - Archives of Acoustics, 2019 - bibliotekanauki.pl
The most challenging in speech enhancement technique is tracking non-stationary noises
for long speech segments and low Signal-to-Noise Ratio (SNR). Different speech …

A free synthetic corpus for speaker diarization research

E Edwards, M Brenndoerfer, A Robinson… - Speech and Computer …, 2018 - Springer
A synthetic corpus of dialogs was constructed from the LibriSpeech corpus, and is made
freely available for diarization research. It includes over 90 h of training data, and over 9 h …

FRS: Adaptive Score for Improving Acoustic Source Classification From Noisy Signals

R Marinati, R Coelho, L Zão - IEEE Signal Processing Letters, 2024 - ieeexplore.ieee.org
This letter introduces a Frame Relevance Score (FRS) to improve the classification of
environmental acoustic sources from noisy speech signals. The importance of each short …

Fusion multistyle training for speaker identification of disguised speech

S Prasad, R Prasad - Wireless Personal Communications, 2019 - Springer
Determining the speaker of a given speech utterance from a group of people is referred to as
speaker identification. When voice disguising is done by a person, which is commonly seen …