An overview of voice conversion and its challenges: From statistical modeling to deep learning

B Sisman, J Yamagishi, S King… - IEEE/ACM Transactions …, 2020 - ieeexplore.ieee.org
Speaker identity is one of the important characteristics of human speech. In voice
conversion, we change the speaker identity from one to another, while keeping the linguistic …

An overview of voice conversion systems

SH Mohammadi, A Kain - Speech Communication, 2017 - Elsevier
Voice transformation (VT) aims to change one or more aspects of a speech signal while
preserving linguistic information. A subset of VT, Voice conversion (VC) specifically aims to …

Parallel-data-free voice conversion using cycle-consistent adversarial networks

T Kaneko, H Kameoka - arXiv preprint arXiv:1711.11293, 2017 - arxiv.org
We propose a parallel-data-free voice-conversion (VC) method that can learn a mapping
from source to target speech without relying on parallel data. The proposed method is …

[PDF][PDF] A Voice Conversion Framework with Tandem Feature Sparse Representation and Speaker-Adapted WaveNet Vocoder.

B Sisman, M Zhang, H Li - Interspeech, 2018 - isca-archive.org
A voice conversion system typically consists of two modules, the feature conversion module
that is followed by a vocoder. The exemplar-based sparse representation marks a success …

Sparse representation of phonetic features for voice conversion with and without parallel data

B Çişman, H Li, KC Tan - 2017 IEEE Automatic Speech …, 2017 - ieeexplore.ieee.org
This paper presents a voice conversion framework that uses phonetic information in an
exemplar-based voice conversion approach. The proposed idea is motivated by the fact that …

Transformation of prosody in voice conversion

B Şişman, H Li, KC Tan - 2017 Asia-Pacific Signal and …, 2017 - ieeexplore.ieee.org
Voice Conversion (VC) aims to convert one's voice to sound like that of another. So far, most
of the voice conversion frameworks mainly focus only on the conversion of spectrum. We …

Digital Speech Makeup: Voice Conversion Based Altered Auditory Feedback for Transforming Self-Representation

R Arakawa, Z Kashino, S Takamichi… - Proceedings of the …, 2021 - dl.acm.org
Makeup (ie, cosmetics) has long been used to transform not only one's appearance but also
their self-representation. Previous studies have demonstrated that visual transformations …

[PDF][PDF] Phonetically Aware Exemplar-Based Prosody Transformation.

B Sisman, G Lee, H Li - Odyssey, 2018 - isca-archive.org
In this paper, we propose a novel prosody transformation framework for voice conversion by
making use of phonetic information. The proposed framework is motivated by two …

Machine learning for limited data voice conversion

B Sisman - 2019 - search.proquest.com
Voice Conversion aims to convert one's voice to sound like that of another. This thesis is
focused on developing advanced machine learning algorithms and frameworks for voice …

Hiding Sensitive Information in Desensitized Voice Sequences

H Huang, J Zhang, D Chen… - … Conference on Data …, 2023 - ieeexplore.ieee.org
Voice data is broadly acquired and utilized by consumer services. In order to process such
data, most of the raw records are sent to web servers, possibly with dedicated acceleration …