Joint domain adaptation and speech bandwidth extension using time-domain GANs for speaker...

R Yoneyama, R Yamamoto… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org

Neural audio super-resolution models are typically trained on low-and high-resolution audio
signal pairs. Although these methods achieve highly accurate super-resolution if the …

被引用次数：10 相关文章所有 3 个版本

[PDF] arxiv.org

Time-domain speech super-resolution with gan based modeling for telephony speaker verification

S Kataria, J Villalba, L Moro-Velázquez… - … on Audio, Speech …, 2024 - ieeexplore.ieee.org

Automatic Speaker Verification (ASV) technology has become commonplace in virtual
assistants. However, its performance suffers when there is a mismatch between the train and …

被引用次数：4 相关文章所有 4 个版本

[PDF] arxiv.org

Self-FiLM: Conditioning GANs with self-supervised representations for bandwidth extension based speaker recognition

S Kataria, J Villalba, L Moro-Velázquez… - arXiv preprint arXiv …, 2023 - arxiv.org

Speech super-resolution/Bandwidth Extension (BWE) can improve downstream tasks like
Automatic Speaker Verification (ASV). We introduce a simple novel technique called Self …

被引用次数：1 相关文章所有 5 个版本

CI-Mix: cut instance mix for robust speaker verification

Y Duan, Y Long, Y Li - International Journal of Speech Technology, 2023 - Springer

Data augmentation is commonly used to help build a robust speaker verification system,
especially when resources are limited. In this paper, we generalize the idea of CutMix to cut …

Robust Speaker Recognition using Perceptual and Adversarial Speech Enhancement

S Kataria - 2023 - jscholarship.library.jhu.edu

Abstract In Automatic Speaker Verification (ASV), we determine whether the speaker in the
test utterance is identical to the previously enrolled speaker. Deep learning has significantly …

被引用次数：1 相关文章所有 2 个版本

高级搜索

QQ 群