A review of deep learning techniques for speech processing

A Mehrish, N Majumder, R Bharadwaj, R Mihalcea… - Information …, 2023 - Elsevier
The field of speech processing has undergone a transformative shift with the advent of deep
learning. The use of multiple processing layers has enabled the creation of models capable …

Speaker recognition based on deep learning: An overview

Z Bai, XL Zhang - Neural Networks, 2021 - Elsevier
Speaker recognition is a task of identifying persons from their voices. Recently, deep
learning has dramatically revolutionized speaker recognition. However, there is lack of …

A survey of speaker recognition: Fundamental theories, recognition methods and opportunities

MM Kabir, MF Mridha, J Shin, I Jahan, AQ Ohi - IEEE Access, 2021 - ieeexplore.ieee.org
Humans can identify a speaker by listening to their voice, over the telephone, or on any
digital devices. Acquiring this congenital human competency, authentication technologies …

The ins and outs of speaker recognition: lessons from VoxSRC 2020

Y Kwon, HS Heo, BJ Lee… - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
The VoxCeleb Speaker Recognition Challenge (VoxSRC) at Interspeech 2020 offers a
challenging evaluation for speaker recognition systems, which includes celebrities playing …

Deep speaker recognition: Process, progress, and challenges

AQ Ohi, MF Mridha, MA Hamid, MM Monowar - IEEE Access, 2021 - ieeexplore.ieee.org
Speaker recognition is related to human biometrics dealing with the identification of
speakers from their speech. Speaker recognition is an active research area and being …

Overview of speaker modeling and its applications: From the lens of deep speaker representation learning

S Wang, Z Chen, KA Lee, Y Qian… - IEEE/ACM Transactions …, 2024 - ieeexplore.ieee.org
Speaker individuality information is among the most critical elements within speech signals.
By thoroughly and accurately modeling this information, it can be utilized in various …

Improved meta learning for low resource speech recognition

S Singh, R Wang, F Hou - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
We propose a new meta learning based framework for low resource speech recognition that
improves the previous model agnostic meta learning (MAML) approach. The MAML is a …

RawNeXt: Speaker verification system for variable-duration utterances with deep layer aggregation and extended dynamic scaling policies

J Kim, H Shim, J Heo, HJ Yu - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
Despite achieving satisfactory performance in speaker verification using deep neural
networks, variable-duration utterances remain a challenge that threatens the robustness of …

End-to-end speaker verification via curriculum bipartite ranking weighted binary cross-entropy

Z Bai, J Wang, XL Zhang, J Chen - IEEE/ACM Transactions on …, 2022 - ieeexplore.ieee.org
End-to-end speaker verification achieves the verification through estimating directly the
similarity score between a pair of utterances, which is formulated as a binary (ie, target …

Improved end-to-end dysarthric speech recognition via meta-learning based model re-initialization

D Wang, J Yu, X Wu, L Sun, X Liu… - 2021 12th International …, 2021 - ieeexplore.ieee.org
Dysarthric speech recognition is a challenging task as dysarthric data is limited and its
acoustics deviate significantly from normal speech. Model-based speaker adaptation is a …