The voices from a distance challenge 2019 evaluation plan

Z Bai, XL Zhang - Neural Networks, 2021 - Elsevier

Speaker recognition is a task of identifying persons from their voices. Recently, deep
learning has dramatically revolutionized speaker recognition. However, there is lack of …

被引用次数：439 相关文章所有 9 个版本

[PDF] arxiv.org

Overview of speaker modeling and its applications: From the lens of deep speaker representation learning

S Wang, Z Chen, KA Lee, Y Qian… - IEEE/ACM Transactions …, 2024 - ieeexplore.ieee.org

Speaker individuality information is among the most critical elements within speech signals.
By thoroughly and accurately modeling this information, it can be utilized in various …

被引用次数：4 相关文章所有 4 个版本

[PDF] github.io

Wavoice: A noise-resistant multi-modal speech recognition system fusing mmwave and audio signals

T Liu, M Gao, F Lin, C Wang, Z Ba, J Han… - Proceedings of the 19th …, 2021 - dl.acm.org

With the advance in automatic speech recognition, voice user interface has gained
popularity recently. Since the COVID-19 pandemic, VUI is increasingly preferred in online …

被引用次数：84 相关文章所有 4 个版本

[PDF] arxiv.org

Voxsrc 2020: The second voxceleb speaker recognition challenge

A Nagrani, JS Chung, J Huh, A Brown, E Coto… - arXiv preprint arXiv …, 2020 - arxiv.org

We held the second installment of the VoxCeleb Speaker Recognition Challenge in
conjunction with Interspeech 2020. The goal of this challenge was to assess how well …

被引用次数：92 相关文章所有 2 个版本

[PDF] arxiv.org

Voxsrc 2021: The third voxceleb speaker recognition challenge

A Brown, J Huh, JS Chung, A Nagrani… - arXiv preprint arXiv …, 2022 - arxiv.org

The third instalment of the VoxCeleb Speaker Recognition Challenge was held in
conjunction with Interspeech 2021. The aim of this challenge was to assess how well current …

被引用次数：57 相关文章所有 2 个版本

[PDF] arxiv.org

Deep speaker embeddings for far-field speaker recognition on short utterances

A Gusev, V Volokhov, T Andzhukaev… - arXiv preprint arXiv …, 2020 - arxiv.org

Speaker recognition systems based on deep speaker embeddings have achieved
significant performance in controlled conditions according to the results obtained for early …

被引用次数：59 相关文章所有 9 个版本

[PDF] arxiv.org

NPLDA: A deep neural PLDA model for speaker verification

S Ramoji, P Krishnan, S Ganapathy - arXiv preprint arXiv:2002.03562, 2020 - arxiv.org

The state-of-art approach for speaker verification consists of a neural network based
embedding extractor along with a backend generative model such as the Probabilistic …

被引用次数：44 相关文章所有 7 个版本

[PDF] danielpovey.com

[PDF][PDF] The JHU Speaker Recognition System for the VOiCES 2019 Challenge.

D Snyder, J Villalba, N Chen, D Povey, G Sell… - …, 2019 - danielpovey.com

This paper describes the systems developed by the JHU team for the speaker recognition
track of the 2019 VOiCES from a Distance Challenge. On this far-field task, we achieved …

被引用次数：47 相关文章所有 11 个版本

[PDF] arxiv.org

Feature enhancement with deep feature losses for speaker verification

S Kataria, PS Nidadavolu, J Villalba… - ICASSP 2020-2020 …, 2020 - ieeexplore.ieee.org

Speaker Verification still suffers from the challenge of generalization to novel adverse
environments. We leverage on the recent advancements made by deep learning based …

被引用次数：39 相关文章所有 6 个版本

[PDF] isca-archive.org

[PDF][PDF] VoxTube: a multilingual speaker recognition dataset

I Yakovlev, A Okhotnikov, N Torgashov… - Proc …, 2023 - isca-archive.org

The objective of this paper is to advance the development of technologies in the fields of
speaker recognition and speaker identification by introducing a large labeled audio …

被引用次数：13 相关文章所有 3 个版本

高级搜索

QQ 群