Speaker recognition based on deep learning: An overview

Z Bai, XL Zhang - Neural Networks, 2021 - Elsevier
Speaker recognition is a task of identifying persons from their voices. Recently, deep
learning has dramatically revolutionized speaker recognition. However, there is lack of …

An overview of deep-learning-based audio-visual speech enhancement and separation

D Michelsanti, ZH Tan, SX Zhang, Y Xu… - … on Audio, Speech …, 2021 - ieeexplore.ieee.org
Speech enhancement and speech separation are two related tasks, whose purpose is to
extract either one or more target speech signals, respectively, from a mixture of sounds …

[HTML][HTML] Self-attentive speaker embeddings for text-independent speaker verification.

Y Zhu, T Ko, D Snyder, B Mak, D Povey - Interspeech, 2018 - pianshen.com
摘要This paper introduces a new method to extract speaker embed-dings from a deep
neural network (DNN) for text-independent speaker verification. Usually, speaker …

X-vectors: Robust dnn embeddings for speaker recognition

D Snyder, D Garcia-Romero, G Sell… - … on acoustics, speech …, 2018 - ieeexplore.ieee.org
In this paper, we use data augmentation to improve performance of deep neural network
(DNN) embeddings for speaker recognition. The DNN, which is trained to discriminate …

Generalized end-to-end loss for speaker verification

L Wan, Q Wang, A Papir… - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
In this paper, we propose a new loss function called generalized end-to-end (GE2E) loss,
which makes the training of speaker verification models more efficient than our previous …

Biometrics recognition using deep learning: A survey

S Minaee, A Abdolrashidi, H Su, M Bennamoun… - Artificial Intelligence …, 2023 - Springer
In the past few years, deep learning-based models have been very successful in achieving
state-of-the-art results in many tasks in computer vision, speech recognition, and natural …

Cn-celeb: multi-genre speaker recognition

L Li, R Liu, J Kang, Y Fan, H Cui, Y Cai, R Vipperla… - Speech …, 2022 - Elsevier
Research on speaker recognition is extending to address the vulnerability in the wild
conditions, among which genre mismatch is perhaps the most challenging, for instance …

[PDF][PDF] End-to-end text-independent speaker verification with triplet loss on short utterances.

C Zhang, K Koishida - Interspeech, 2017 - isca-archive.org
Text-independent speaker verification against short utterances is still challenging despite of
recent advances in the field of speaker recognition with i-vector framework. In general, to get …

Fooling end-to-end speaker verification with adversarial examples

F Kreuk, Y Adi, M Cisse, J Keshet - 2018 IEEE international …, 2018 - ieeexplore.ieee.org
Automatic speaker verification systems are increasingly used as the primary means to
authenticate costumers. Recently, it has been proposed to train speaker verification systems …

Speech processing for digital home assistants: Combining signal processing with deep-learning techniques

R Haeb-Umbach, S Watanabe… - IEEE Signal …, 2019 - ieeexplore.ieee.org
Once a popular theme of futuristic science fiction or far-fetched technology forecasts, digital
home assistants with a spoken language interface have become a ubiquitous commodity …