Speaker recognition based on deep learning: An overview

Z Bai, XL Zhang - Neural Networks, 2021 - Elsevier
Speaker recognition is a task of identifying persons from their voices. Recently, deep
learning has dramatically revolutionized speaker recognition. However, there is lack of …

Overview of speaker modeling and its applications: From the lens of deep speaker representation learning

S Wang, Z Chen, KA Lee, Y Qian… - IEEE/ACM Transactions …, 2024 - ieeexplore.ieee.org
Speaker individuality information is among the most critical elements within speech signals.
By thoroughly and accurately modeling this information, it can be utilized in various …

Wavoice: A noise-resistant multi-modal speech recognition system fusing mmwave and audio signals

T Liu, M Gao, F Lin, C Wang, Z Ba, J Han… - Proceedings of the 19th …, 2021 - dl.acm.org
With the advance in automatic speech recognition, voice user interface has gained
popularity recently. Since the COVID-19 pandemic, VUI is increasingly preferred in online …

Voxsrc 2020: The second voxceleb speaker recognition challenge

A Nagrani, JS Chung, J Huh, A Brown, E Coto… - arXiv preprint arXiv …, 2020 - arxiv.org
We held the second installment of the VoxCeleb Speaker Recognition Challenge in
conjunction with Interspeech 2020. The goal of this challenge was to assess how well …

Voxsrc 2021: The third voxceleb speaker recognition challenge

A Brown, J Huh, JS Chung, A Nagrani… - arXiv preprint arXiv …, 2022 - arxiv.org
The third instalment of the VoxCeleb Speaker Recognition Challenge was held in
conjunction with Interspeech 2021. The aim of this challenge was to assess how well current …

Deep speaker embeddings for far-field speaker recognition on short utterances

A Gusev, V Volokhov, T Andzhukaev… - arXiv preprint arXiv …, 2020 - arxiv.org
Speaker recognition systems based on deep speaker embeddings have achieved
significant performance in controlled conditions according to the results obtained for early …

NPLDA: A deep neural PLDA model for speaker verification

S Ramoji, P Krishnan, S Ganapathy - arXiv preprint arXiv:2002.03562, 2020 - arxiv.org
The state-of-art approach for speaker verification consists of a neural network based
embedding extractor along with a backend generative model such as the Probabilistic …

[PDF][PDF] The JHU Speaker Recognition System for the VOiCES 2019 Challenge.

D Snyder, J Villalba, N Chen, D Povey, G Sell… - …, 2019 - danielpovey.com
This paper describes the systems developed by the JHU team for the speaker recognition
track of the 2019 VOiCES from a Distance Challenge. On this far-field task, we achieved …

Feature enhancement with deep feature losses for speaker verification

S Kataria, PS Nidadavolu, J Villalba… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org
Speaker Verification still suffers from the challenge of generalization to novel adverse
environments. We leverage on the recent advancements made by deep learning based …

[PDF][PDF] VoxTube: a multilingual speaker recognition dataset

I Yakovlev, A Okhotnikov, N Torgashov… - Proc …, 2023 - isca-archive.org
The objective of this paper is to advance the development of technologies in the fields of
speaker recognition and speaker identification by introducing a large labeled audio …