Synthesizing dysarthric speech using multi-speaker tts for dysarthric speech recognition

M Soleymanpour, MT Johnson… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Dysarthria is a motor speech disorder often characterized by reduced speech intelligibility
through slow, uncoordinated control of speech production muscles. Automatic Speech …

Unsupervised representation disentanglement using cross domain features and adversarial learning in variational autoencoder based voice conversion

WC Huang, H Luo, HT Hwang, CC Lo… - … on Emerging Topics …, 2020 - ieeexplore.ieee.org
An effective approach for voice conversion (VC) is to disentangle linguistic content from
other components in the speech signal. The effectiveness of variational autoencoder (VAE) …

Velody: Nonlinear vibration challenge-response for resilient user authentication

J Li, K Fawaz, Y Kim - Proceedings of the 2019 ACM SIGSAC …, 2019 - dl.acm.org
Biometrics have been widely adopted for enhancing user authentication, benefiting usability
by exploiting pervasive and collectible unique characteristics from physiological or …

Assessing the scope of generalized countermeasures for anti-spoofing

RK Das, J Yang, H Li - ICASSP 2020-2020 IEEE International …, 2020 - ieeexplore.ieee.org
Most of the research on anti-spoofing countermeasures are specific to a type of spoofing
attacks, where models are trained on data of a particular nature, either synthetic or replay …

[PDF][PDF] Attention-Based Convolutional Neural Network for ASV Spoofing Detection.

H Ling, L Huang, J Huang, B Zhang, P Li - Interspeech, 2021 - isca-archive.org
In recent years, automatic speaker verification (ASV) algorithms have undergone significant
progress. They have been widely deployed in different applications, but the ASV systems …

Data augmentation with signal companding for detection of logical access attacks

RK Das, J Yang, H Li - ICASSP 2021-2021 IEEE International …, 2021 - ieeexplore.ieee.org
The recent advances in voice conversion (VC) and text-to-speech (TTS) make it possible to
produce natural sounding speech that poses threat to automatic speaker verification (ASV) …

Audio Deepfake Approaches

OA Shaaban, R Yildirim, AA Alguttar - IEEE Access, 2023 - ieeexplore.ieee.org
This paper presents a review of techniques involved in the creation and detection of audio
deepfakes, the first section provides information about general deep fakes. In the second …

Voice mimicry attacks assisted by automatic speaker verification

V Vestman, T Kinnunen, RG Hautamäki… - Computer Speech & …, 2020 - Elsevier
In this work, we simulate a scenario, where a publicly available ASV system is used to
enhance mimicry attacks against another closed source ASV system. In specific, ASV …

On the study of generative adversarial networks for cross-lingual voice conversion

B Sisman, M Zhang, M Dong… - 2019 IEEE Automatic …, 2019 - ieeexplore.ieee.org
Cross-lingual voice conversion (VC) aims to convert the source speaker's voice to sound like
that of the target speaker, when the source and target speakers speak different languages …

Investigation of learning abilities on linguistic features in sequence-to-sequence text-to-speech synthesis

Y Yasuda, X Wang, J Yamagishi - Computer Speech & Language, 2021 - Elsevier
Neural sequence-to-sequence text-to-speech synthesis (TTS) can produce high-quality
speech directly from text or simple linguistic features such as phonemes. Unlike traditional …