Nvc-net: End-to-end adversarial voice conversion

B Nguyen, F Cardinaux - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
Voice conversion (VC) has gained increasing popularity in many speech synthesis
applications. The idea is to change the voice identity from one speaker into another while …

Joint training framework for text-to-speech and voice conversion using multi-source tacotron and wavenet

M Zhang, X Wang, F Fang, H Li, J Yamagishi - arXiv preprint arXiv …, 2019 - arxiv.org
We investigated the training of a shared model for both text-to-speech (TTS) and voice
conversion (VC) tasks. We propose using an extended model architecture of Tacotron, that …

Can we steal your vocal identity from the Internet?: Initial investigation of cloning Obama's voice using GAN, WaveNet and low-quality found data

J Lorenzo-Trueba, F Fang, X Wang, I Echizen… - arXiv preprint arXiv …, 2018 - arxiv.org
Thanks to the growing availability of spoofing databases and rapid advances in using them,
systems for detecting voice spoofing attacks are becoming more and more capable, and …

Seismic impedance inversion using conditional generative adversarial network

D Meng, B Wu, Z Wang, Z Zhu - IEEE Geoscience and Remote …, 2021 - ieeexplore.ieee.org
Deep-learning methods, such as convolutional neural networks (CNNs), have been
successfully applied to seismic impedance inversion in recent years. Compared with …

Synthesizing dysarthric speech using multi-speaker tts for dysarthric speech recognition

M Soleymanpour, MT Johnson… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org
Dysarthria is a motor speech disorder often characterized by reduced speech intelligibility
through slow, uncoordinated control of speech production muscles. Automatic Speech …

Catch You and I Can: Revealing source voiceprint against voice conversion

J Deng, Y Chen, Y Zhong, Q Miao, X Gong… - 32nd USENIX Security …, 2023 - usenix.org
Voice conversion (VC) techniques can be abused by malicious parties to transform their
audios to sound like a target speaker, making it hard for a human being or a speaker …

SINGAN: Singing voice conversion with generative adversarial networks

B Sisman, K Vijayan, M Dong… - 2019 Asia-Pacific Signal …, 2019 - ieeexplore.ieee.org
Singing voice conversion (SVC) is a task to convert the source singer's voice to sound like
that of the target singer, without changing the lyrical content. So far, most of the voice …

Towards General-Purpose Text-Instruction-Guided Voice Conversion

CY Kuan, CA Li, TY Hsu, TY Lin… - 2023 IEEE Automatic …, 2023 - ieeexplore.ieee.org
This paper introduces a novel voice conversion (VC) model, guided by text instructions such
as “articulate slowly with a deep tone “or “speak in a cheerful boyish voice”. Unlike …

[HTML][HTML] Speaker anonymization by modifying fundamental frequency and x-vector singular value

CO Mawalim, K Galajit, J Karnjana, S Kidani… - Computer Speech & …, 2022 - Elsevier
Speaker anonymization is a method of protecting voice privacy by concealing individual
speaker characteristics while preserving linguistic information. The VoicePrivacy Challenge …

Voice conversion based on cross-domain features using variational auto encoders

WC Huang, HT Hwang, YH Peng… - … on Chinese Spoken …, 2018 - ieeexplore.ieee.org
An effective approach to non-parallel voice conversion (VC) is to utilize deep neural
networks (DNNs), specifically variational auto encoders (VAEs), to model the latent structure …