Can we steal your vocal identity from the Internet?: Initial investigation of cloning Obama's...

M Soleymanpour, MT Johnson… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org

Dysarthria is a motor speech disorder often characterized by reduced speech intelligibility
through slow, uncoordinated control of speech production muscles. Automatic Speech …

被引用次数：25 相关文章所有 4 个版本

[PDF] ieee.org

Unsupervised representation disentanglement using cross domain features and adversarial learning in variational autoencoder based voice conversion

WC Huang, H Luo, HT Hwang, CC Lo… - … on Emerging Topics …, 2020 - ieeexplore.ieee.org

An effective approach for voice conversion (VC) is to disentangle linguistic content from
other components in the speech signal. The effectiveness of variational autoencoder (VAE) …

被引用次数：50 相关文章所有 7 个版本

[PDF] acm.org

Velody: Nonlinear vibration challenge-response for resilient user authentication

J Li, K Fawaz, Y Kim - Proceedings of the 2019 ACM SIGSAC …, 2019 - dl.acm.org

Biometrics have been widely adopted for enhancing user authentication, benefiting usability
by exploiting pervasive and collectible unique characteristics from physiological or …

被引用次数：51 相关文章所有 5 个版本

[PDF] researchgate.net

Assessing the scope of generalized countermeasures for anti-spoofing

RK Das, J Yang, H Li - ICASSP 2020-2020 IEEE International …, 2020 - ieeexplore.ieee.org

Most of the research on anti-spoofing countermeasures are specific to a type of spoofing
attacks, where models are trained on data of a particular nature, either synthetic or replay …

被引用次数：46 相关文章所有 3 个版本

[PDF] isca-archive.org

[PDF][PDF] Attention-Based Convolutional Neural Network for ASV Spoofing Detection.

H Ling, L Huang, J Huang, B Zhang, P Li - Interspeech, 2021 - isca-archive.org

In recent years, automatic speaker verification (ASV) algorithms have undergone significant
progress. They have been widely deployed in different applications, but the ASV systems …

被引用次数：24 相关文章所有 4 个版本

[PDF] arxiv.org

Data augmentation with signal companding for detection of logical access attacks

RK Das, J Yang, H Li - ICASSP 2021-2021 IEEE International …, 2021 - ieeexplore.ieee.org

The recent advances in voice conversion (VC) and text-to-speech (TTS) make it possible to
produce natural sounding speech that poses threat to automatic speaker verification (ASV) …

被引用次数：31 相关文章所有 4 个版本

[PDF] ieee.org

Audio Deepfake Approaches

OA Shaaban, R Yildirim, AA Alguttar - IEEE Access, 2023 - ieeexplore.ieee.org

This paper presents a review of techniques involved in the creation and detection of audio
deepfakes, the first section provides information about general deep fakes. In the second …

被引用次数：9 相关文章所有 3 个版本

[PDF] arxiv.org

Voice mimicry attacks assisted by automatic speaker verification

V Vestman, T Kinnunen, RG Hautamäki… - Computer Speech & …, 2020 - Elsevier

In this work, we simulate a scenario, where a publicly available ASV system is used to
enhance mimicry attacks against another closed source ASV system. In specific, ASV …

被引用次数：45 相关文章所有 13 个版本

[PDF] a-star.edu.sg

On the study of generative adversarial networks for cross-lingual voice conversion

B Sisman, M Zhang, M Dong… - 2019 IEEE Automatic …, 2019 - ieeexplore.ieee.org

Cross-lingual voice conversion (VC) aims to convert the source speaker's voice to sound like
that of the target speaker, when the source and target speakers speak different languages …

被引用次数：39 相关文章所有 4 个版本

[PDF] sciencedirect.com

Investigation of learning abilities on linguistic features in sequence-to-sequence text-to-speech synthesis

Y Yasuda, X Wang, J Yamagishi - Computer Speech & Language, 2021 - Elsevier

Neural sequence-to-sequence text-to-speech synthesis (TTS) can produce high-quality
speech directly from text or simple linguistic features such as phonemes. Unlike traditional …

被引用次数：33 相关文章所有 4 个版本

高级搜索

QQ 群