相关文章- 学术资源搜索

Multi-target voice conversion without parallel data by adversarially learning disentangled audio representations

J Chou, C Yeh, H Lee, L Lee - arXiv preprint arXiv:1804.02812, 2018 - arxiv.org

Recently, cycle-consistent adversarial network (Cycle-GAN) has been successfully applied
to voice conversion to a different speaker without parallel data, although in those …

被引用次数：153 相关文章所有 9 个版本

[PDF] arxiv.org

Voice conversion from unaligned corpora using variational autoencoding wasserstein generative adversarial networks

CC Hsu, HT Hwang, YC Wu, Y Tsao… - arXiv preprint arXiv …, 2017 - arxiv.org

Building a voice conversion (VC) system from non-parallel speech corpora is challenging
but highly valuable in real application scenarios. In most situations, the source and the target …

被引用次数：452 相关文章所有 11 个版本

[PDF] arxiv.org

Stargan-vc: Non-parallel many-to-many voice conversion using star generative adversarial networks

H Kameoka, T Kaneko, K Tanaka… - 2018 IEEE Spoken …, 2018 - ieeexplore.ieee.org

This paper proposes a method that allows non-parallel many-to-many voice conversion (VC)
by using a variant of a generative adversarial network (GAN) called StarGAN. Our method …

被引用次数：450 相关文章所有 5 个版本

[PDF] arxiv.org

Speech representation disentanglement with adversarial mutual information learning for one-shot voice conversion

SC Yang, M Tantrawenith, H Zhuang, Z Wu… - arXiv preprint arXiv …, 2022 - arxiv.org

One-shot voice conversion (VC) with only a single target speaker's speech for reference has
become a hot research topic. Existing works generally disentangle timbre, while information …

被引用次数：25 相关文章所有 6 个版本

[PDF] ntt.co.jp

[PDF][PDF] Sequence-to-Sequence Voice Conversion with Similarity Metric Learned Using Generative Adversarial Networks.

T Kaneko, H Kameoka, K Hiramatsu, K Kashino - Interspeech, 2017 - kecl.ntt.co.jp

We propose a training framework for sequence-to-sequence voice conversion (SVC). A well-
known problem regarding a conventional VC framework is that acoustic-feature sequences …

被引用次数：132 相关文章所有 3 个版本

[PDF] arxiv.org

S2VC: A framework for any-to-any voice conversion with self-supervised pretrained representations

J Lin, YY Lin, CM Chien, H Lee - arXiv preprint arXiv:2104.02901, 2021 - arxiv.org

Any-to-any voice conversion (VC) aims to convert the timbre of utterances from and to any
speakers seen or unseen during training. Various any-to-any VC approaches have been …

被引用次数：53 相关文章所有 6 个版本

[PDF] arxiv.org

Cyclegan-vc2: Improved cyclegan-based non-parallel voice conversion

T Kaneko, H Kameoka, K Tanaka… - ICASSP 2019-2019 …, 2019 - ieeexplore.ieee.org

Non-parallel voice conversion (VC) is a technique for learning the mapping from source to
target speech without relying on parallel data. This is an important task, but it has been …

被引用次数：311 相关文章所有 7 个版本

[PDF] researchgate.net

High-quality nonparallel voice conversion based on cycle-consistent adversarial network

F Fang, J Yamagishi, I Echizen… - … on acoustics, speech …, 2018 - ieeexplore.ieee.org

Although voice conversion (VC) algorithms have achieved remarkable success along with
the development of machine learning, superior performance is still difficult to achieve when …

被引用次数：164 相关文章所有 9 个版本

[PDF] ieee.org

ConvS2S-VC: Fully convolutional sequence-to-sequence voice conversion

H Kameoka, K Tanaka, D Kwaśny… - … on audio, speech …, 2020 - ieeexplore.ieee.org

This article proposes a voice conversion (VC) method using sequence-to-sequence
(seq2seq or S2S) learning, which flexibly converts not only the voice characteristics but also …

被引用次数：71 相关文章所有 6 个版本

[PDF] ntt.co.jp

Cyclegan-vc: Non-parallel voice conversion using cycle-consistent adversarial networks

T Kaneko, H Kameoka - 2018 26th European Signal …, 2018 - ieeexplore.ieee.org

We propose a non-parallel voice-conversion (VC) method that can learn a mapping from
source to target speech without relying on parallel data. The proposed method is particularly …

被引用次数：327 相关文章所有 8 个版本

高级搜索

QQ 群

Multi-target voice conversion without parallel data by adversarially learning disentangled audio representations

Voice conversion from unaligned corpora using variational autoencoding wasserstein generative adversarial networks

Stargan-vc: Non-parallel many-to-many voice conversion using star generative adversarial networks

Speech representation disentanglement with adversarial mutual information learning for one-shot voice conversion

[PDF][PDF] Sequence-to-Sequence Voice Conversion with Similarity Metric Learned Using Generative Adversarial Networks.

S2VC: A framework for any-to-any voice conversion with self-supervised pretrained representations

Cyclegan-vc2: Improved cyclegan-based non-parallel voice conversion

High-quality nonparallel voice conversion based on cycle-consistent adversarial network

ConvS2S-VC: Fully convolutional sequence-to-sequence voice conversion

Cyclegan-vc: Non-parallel voice conversion using cycle-consistent adversarial networks

引用