High-quality nonparallel voice conversion based on cycle-consistent adversarial network

B Nguyen, F Cardinaux - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org

Voice conversion (VC) has gained increasing popularity in many speech synthesis
applications. The idea is to change the voice identity from one speaker into another while …

被引用次数：42 相关文章所有 4 个版本

[PDF] arxiv.org

Joint training framework for text-to-speech and voice conversion using multi-source tacotron and wavenet

M Zhang, X Wang, F Fang, H Li, J Yamagishi - arXiv preprint arXiv …, 2019 - arxiv.org

We investigated the training of a shared model for both text-to-speech (TTS) and voice
conversion (VC) tasks. We propose using an extended model architecture of Tacotron, that …

被引用次数：76 相关文章所有 7 个版本

[PDF] arxiv.org

Can we steal your vocal identity from the Internet?: Initial investigation of cloning Obama's voice using GAN, WaveNet and low-quality found data

J Lorenzo-Trueba, F Fang, X Wang, I Echizen… - arXiv preprint arXiv …, 2018 - arxiv.org

Thanks to the growing availability of spoofing databases and rapid advances in using them,
systems for detecting voice spoofing attacks are becoming more and more capable, and …

被引用次数：88 相关文章所有 11 个版本

[PDF] researchgate.net

Seismic impedance inversion using conditional generative adversarial network

D Meng, B Wu, Z Wang, Z Zhu - IEEE Geoscience and Remote …, 2021 - ieeexplore.ieee.org

Deep-learning methods, such as convolutional neural networks (CNNs), have been
successfully applied to seismic impedance inversion in recent years. Compared with …

被引用次数：35 相关文章所有 3 个版本

[PDF] uky.edu

Synthesizing dysarthric speech using multi-speaker tts for dysarthric speech recognition

M Soleymanpour, MT Johnson… - ICASSP 2022-2022 …, 2022 - ieeexplore.ieee.org

Dysarthria is a motor speech disorder often characterized by reduced speech intelligibility
through slow, uncoordinated control of speech production muscles. Automatic Speech …

被引用次数：25 相关文章所有 4 个版本

[PDF] usenix.org

Catch You and I Can: Revealing source voiceprint against voice conversion

J Deng, Y Chen, Y Zhong, Q Miao, X Gong… - 32nd USENIX Security …, 2023 - usenix.org

Voice conversion (VC) techniques can be abused by malicious parties to transform their
audios to sound like a target speaker, making it hard for a human being or a speaker …

被引用次数：9 相关文章所有 6 个版本

[PDF] academia.edu

SINGAN: Singing voice conversion with generative adversarial networks

B Sisman, K Vijayan, M Dong… - 2019 Asia-Pacific Signal …, 2019 - ieeexplore.ieee.org

Singing voice conversion (SVC) is a task to convert the source singer's voice to sound like
that of the target singer, without changing the lyrical content. So far, most of the voice …

被引用次数：47 相关文章所有 4 个版本

[PDF] arxiv.org

Towards General-Purpose Text-Instruction-Guided Voice Conversion

CY Kuan, CA Li, TY Hsu, TY Lin… - 2023 IEEE Automatic …, 2023 - ieeexplore.ieee.org

This paper introduces a novel voice conversion (VC) model, guided by text instructions such
as “articulate slowly with a deep tone “or “speak in a cheerful boyish voice”. Unlike …

被引用次数：6 相关文章所有 3 个版本

[HTML] sciencedirect.com

[HTML][HTML] Speaker anonymization by modifying fundamental frequency and x-vector singular value

CO Mawalim, K Galajit, J Karnjana, S Kidani… - Computer Speech & …, 2022 - Elsevier

Speaker anonymization is a method of protecting voice privacy by concealing individual
speaker characteristics while preserving linguistic information. The VoicePrivacy Challenge …

被引用次数：21 相关文章所有 3 个版本

[PDF] arxiv.org

Voice conversion based on cross-domain features using variational auto encoders

WC Huang, HT Hwang, YH Peng… - … on Chinese Spoken …, 2018 - ieeexplore.ieee.org

An effective approach to non-parallel voice conversion (VC) is to utilize deep neural
networks (DNNs), specifically variational auto encoders (VAEs), to model the latent structure …

被引用次数：53 相关文章所有 8 个版本

高级搜索

QQ 群