MSRANet: Learning discriminative embeddings for speaker verification via channel and spatial attention mechanism in alterable scenarios

Q Zheng, Z Chen, H Liu, Y Lu, J Li, T Liu - Expert Systems with Applications, 2023 - Elsevier
Speaker embeddings have become the most popular feature representation in speaker
verification. Improving the robustness of speaker embedding extraction systems is a crucial …

Short-segment speaker verification using ecapa-tdnn with multi-resolution encoder

S Han, Y Ahn, K Kang, JW Shin - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org
Time-domain approaches have shown the potential to improve the performance of speaker
verification, but still predominant approaches utilize hand-crafted features such as the mel …

Cam: Context-aware masking for robust speaker verification

YQ Yu, S Zheng, H Suo, Y Lei… - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
Performance degradation caused by noise has been a long-standing challenge for speaker
verification. Previous methods usually involve applying a denoising transformation to …

Effective preservation of higher-frequency contents in the context of short utterance based children's speaker verification system

S Aziz, S Shahnawazuddin - Applied Acoustics, 2023 - Elsevier
Developing an automatic speaker verification (ASV) system for children is extremely
challenging due to the unavailability of children's speech corpora. The challenges are …

A comparison of CQT spectrogram with STFT-based acoustic features in Deep Learning-based synthetic speech detection

P Abdzadeh, H Veisi - Journal of AI and Data Mining, 2023 - jad.shahroodut.ac.ir
Automatic Speaker Verification (ASV) systems have proven to bevulnerable to various types
of presentation attacks, among whichLogical Access attacks are manufactured using …

Dynamic noise embedding: Noise aware training and adaptation for speech enhancement

J Lee, Y Jung, M Jung, H Kim - 2020 Asia-Pacific Signal and …, 2020 - ieeexplore.ieee.org
Estimating noise information exactly is crucial for noise aware training in speech
applications including speech enhancement (SE) which is our focus in this paper. To …

RSKNet-MTSP: Effective and portable deep architecture for speaker verification

Y Wu, C Guo, J Zhao, X Jin, J Xu - Neurocomputing, 2022 - Elsevier
The convolutional neural network (CNN) based approaches have shown great success for
speaker verification (SV) tasks, where modeling long temporal context and reducing …

A fused speech enhancement framework for robust speaker verification

Y Wu, T Li, J Zhao, Q Wang, J Xu - IEEE Signal Processing …, 2023 - ieeexplore.ieee.org
Robust speaker verification (RSV) under noisy conditions is still a challenging task.
Recently, some task-specific speech enhancement (SE) approaches are proposed and …

ASTT: acoustic spatial-temporal transformer for short utterance speaker recognition

X Wu, R Li, B Deng, M Zhao, X Du, J Wang… - Multimedia Tools and …, 2023 - Springer
Abstract Text-independent Short Utterance Speaker Recognition (SUSR) is of importance for
the purpose of person authentication. However, it is a great challenge for the speaker …

FA-ExU-Net: the simultaneous training of an embedding extractor and enhancement model for a speaker verification system robust to short noisy utterances

J Kim, J Heo, H Shin, C Lim… - IEEE/ACM Transactions on …, 2024 - ieeexplore.ieee.org
Speaker verification (SV) technology has the potential to enhance personalization and
security in various applications, such as voice assistants, forensics, and access control …