An overview of voice conversion and its challenges: From statistical modeling to deep learning

B Sisman, J Yamagishi, S King… - IEEE/ACM Transactions …, 2020 - ieeexplore.ieee.org
Speaker identity is one of the important characteristics of human speech. In voice
conversion, we change the speaker identity from one to another, while keeping the linguistic …

[HTML][HTML] Overview of voice conversion methods based on deep learning

T Walczyna, Z Piotrowski - Applied sciences, 2023 - mdpi.com
Voice conversion is a process where the essence of a speaker's identity is seamlessly
transferred to another speaker, all while preserving the content of their speech. This usage is …

[PDF][PDF] sprocket: Open-Source Voice Conversion Software.

K Kobayashi, T Toda - Odyssey, 2018 - easychair.org
Statistical voice conversion (VC) is a technique to convert specific non-or paralinguistic
information while keeping linguistic information unchanged, and speaker conversion has …

The voice conversion challenge 2018: Promoting development of parallel and nonparallel methods

J Lorenzo-Trueba, J Yamagishi, T Toda, D Saito… - arXiv preprint arXiv …, 2018 - arxiv.org
We present the Voice Conversion Challenge 2018, designed as a follow up to the 2016
edition with the aim of providing a common framework for evaluating and comparing …

ConvS2S-VC: Fully convolutional sequence-to-sequence voice conversion

H Kameoka, K Tanaka, D Kwaśny… - … on audio, speech …, 2020 - ieeexplore.ieee.org
This article proposes a voice conversion (VC) method using sequence-to-sequence
(seq2seq or S2S) learning, which flexibly converts not only the voice characteristics but also …

The singing voice conversion challenge 2023

WC Huang, LP Violeta, S Liu, J Shi… - 2023 IEEE Automatic …, 2023 - ieeexplore.ieee.org
We present the latest iteration of the voice conversion challenge (VCC) series, a bi-annual
scientific event aiming to compare and understand different voice conversion (VC) systems …

[PDF][PDF] The Voice Conversion Challenge 2016.

T Toda, LH Chen, D Saito, F Villavicencio, M Wester… - Interspeech, 2016 - isca-archive.org
This paper describes the Voice Conversion Challenge 2016 devised by the authors to better
understand different voice conversion (VC) techniques by comparing their performance on a …

The sequence-to-sequence baseline for the voice conversion challenge 2020: Cascading asr and tts

WC Huang, T Hayashi, S Watanabe, T Toda - arXiv preprint arXiv …, 2020 - arxiv.org
This paper presents the sequence-to-sequence (seq2seq) baseline system for the voice
conversion challenge (VCC) 2020. We consider a naive approach for voice conversion (VC) …

Sequence-to-sequence acoustic modeling for voice conversion

JX Zhang, ZH Ling, LJ Liu, Y Jiang… - IEEE/ACM Transactions …, 2019 - ieeexplore.ieee.org
In this paper, a neural network named sequence-to-sequence ConvErsion NeTwork
(SCENT) is presented for acoustic modeling in voice conversion. At training stage, a SCENT …

Voice conversion using deep neural networks with layer-wise generative training

LH Chen, ZH Ling, LJ Liu, LR Dai - IEEE/ACM Transactions on …, 2014 - ieeexplore.ieee.org
This paper presents a new spectral envelope conversion method using deep neural
networks (DNNs). The conventional joint density Gaussian mixture model (JDGMM) based …