The construction of speaker-specific acoustic models for automatic speaker recognition is almost exclusively based on deep neural network-based speaker embeddings. This work …
Initially developed for natural language processing (NLP), Transformer model is now widely used for speech processing tasks such as speaker recognition, due to its powerful sequence …
Transformer models have demonstrated superior performance across various domains, including computer vision, natural language processing, and speech recognition. The …
R Yan, C Wen, S Zhou, T Guo… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
This paper describes our best system and methodology for ADD 2022: The First Audio Deep Synthesis Detection Challenge [1]. The very same system was used for both two rounds of …
Currently, ECAPA-TDNN is one of the state-of-the-art deep models for automatic speaker verification (ASV). However, it focuses too much on local feature extraction with fixed local …
Automated speaker recognition is enabling personalized interactions with the voice-based interfaces and assistants part of the modern cyber-physical-social systems. Prior studies …
Recently, speaker verification systems using deep neural networks have been widely studied. Many of them utilize hand-crafted features such as mel-filterbank energies, mel …
F Xie, D Zhang, C Liu - Applied Sciences, 2022 - mdpi.com
Transformer models are now widely used for speech processing tasks due to their powerful sequence modeling capabilities. Previous work determined an efficient way to model …
In this paper, we propose dictionary attacks against speaker verification-a novel attack vector that aims to match a large fraction of speaker population by chance. We introduce a …