Meta-learning for short utterance speaker recognition with imbalance length pairs

A Mehrish, N Majumder, R Bharadwaj, R Mihalcea… - Information …, 2023 - Elsevier

The field of speech processing has undergone a transformative shift with the advent of deep
learning. The use of multiple processing layers has enabled the creation of models capable …

被引用次数：223 相关文章所有 6 个版本

[PDF] arxiv.org

Speaker recognition based on deep learning: An overview

Z Bai, XL Zhang - Neural Networks, 2021 - Elsevier

Speaker recognition is a task of identifying persons from their voices. Recently, deep
learning has dramatically revolutionized speaker recognition. However, there is lack of …

被引用次数：438 相关文章所有 9 个版本

[PDF] ieee.org

A survey of speaker recognition: Fundamental theories, recognition methods and opportunities

MM Kabir, MF Mridha, J Shin, I Jahan, AQ Ohi - IEEE Access, 2021 - ieeexplore.ieee.org

Humans can identify a speaker by listening to their voice, over the telephone, or on any
digital devices. Acquiring this congenital human competency, authentication technologies …

被引用次数：125 相关文章所有 4 个版本

[PDF] arxiv.org

The ins and outs of speaker recognition: lessons from VoxSRC 2020

Y Kwon, HS Heo, BJ Lee… - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org

The VoxCeleb Speaker Recognition Challenge (VoxSRC) at Interspeech 2020 offers a
challenging evaluation for speaker recognition systems, which includes celebrities playing …

被引用次数：72 相关文章所有 5 个版本

[PDF] ieee.org

Deep speaker recognition: Process, progress, and challenges

AQ Ohi, MF Mridha, MA Hamid, MM Monowar - IEEE Access, 2021 - ieeexplore.ieee.org

Speaker recognition is related to human biometrics dealing with the identification of
speakers from their speech. Speaker recognition is an active research area and being …

被引用次数：49 相关文章所有 5 个版本

[PDF] arxiv.org

Overview of speaker modeling and its applications: From the lens of deep speaker representation learning

S Wang, Z Chen, KA Lee, Y Qian… - IEEE/ACM Transactions …, 2024 - ieeexplore.ieee.org

Speaker individuality information is among the most critical elements within speech signals.
By thoroughly and accurately modeling this information, it can be utilized in various …

被引用次数：4 相关文章所有 4 个版本

[PDF] arxiv.org

Improved meta learning for low resource speech recognition

S Singh, R Wang, F Hou - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org

We propose a new meta learning based framework for low resource speech recognition that
improves the previous model agnostic meta learning (MAML) approach. The MAML is a …

被引用次数：28 相关文章所有 3 个版本

[PDF] arxiv.org

RawNeXt: Speaker verification system for variable-duration utterances with deep layer aggregation and extended dynamic scaling policies

J Kim, H Shim, J Heo, HJ Yu - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org

Despite achieving satisfactory performance in speaker verification using deep neural
networks, variable-duration utterances remain a challenge that threatens the robustness of …

被引用次数：30 相关文章所有 5 个版本

[PDF] xiaolei-zhang.net

End-to-end speaker verification via curriculum bipartite ranking weighted binary cross-entropy

Z Bai, J Wang, XL Zhang, J Chen - IEEE/ACM Transactions on …, 2022 - ieeexplore.ieee.org

End-to-end speaker verification achieves the verification through estimating directly the
similarity score between a pair of utterances, which is formulated as a binary (ie, target …

被引用次数：27 相关文章所有 3 个版本

[PDF] arxiv.org

Improved end-to-end dysarthric speech recognition via meta-learning based model re-initialization

D Wang, J Yu, X Wu, L Sun, X Liu… - 2021 12th International …, 2021 - ieeexplore.ieee.org

Dysarthric speech recognition is a challenging task as dysarthric data is limited and its
acoustics deviate significantly from normal speech. Model-based speaker adaptation is a …

被引用次数：49 相关文章所有 4 个版本

高级搜索

QQ 群