An effective approach for voice conversion (VC) is to disentangle linguistic content from other components in the speech signal. The effectiveness of variational autoencoder (VAE) …
J Li, K Fawaz, Y Kim - Proceedings of the 2019 ACM SIGSAC …, 2019 - dl.acm.org
Biometrics have been widely adopted for enhancing user authentication, benefiting usability by exploiting pervasive and collectible unique characteristics from physiological or …
RK Das, J Yang, H Li - ICASSP 2020-2020 IEEE International …, 2020 - ieeexplore.ieee.org
Most of the research on anti-spoofing countermeasures are specific to a type of spoofing attacks, where models are trained on data of a particular nature, either synthetic or replay …
In recent years, automatic speaker verification (ASV) algorithms have undergone significant progress. They have been widely deployed in different applications, but the ASV systems …
RK Das, J Yang, H Li - ICASSP 2021-2021 IEEE International …, 2021 - ieeexplore.ieee.org
The recent advances in voice conversion (VC) and text-to-speech (TTS) make it possible to produce natural sounding speech that poses threat to automatic speaker verification (ASV) …
This paper presents a review of techniques involved in the creation and detection of audio deepfakes, the first section provides information about general deep fakes. In the second …
In this work, we simulate a scenario, where a publicly available ASV system is used to enhance mimicry attacks against another closed source ASV system. In specific, ASV …
Cross-lingual voice conversion (VC) aims to convert the source speaker's voice to sound like that of the target speaker, when the source and target speakers speak different languages …
Neural sequence-to-sequence text-to-speech synthesis (TTS) can produce high-quality speech directly from text or simple linguistic features such as phonemes. Unlike traditional …