High-quality nonparallel voice conversion based on cycle-consistent adversarial network

R Ferro, N Obin, A Roebel - 2020 28th European Signal …, 2021 - ieeexplore.ieee.org

This paper tackles GAN optimization and stability issues in the context of voice conversion.
First, to simplify the conversion task, we propose to use spectral envelopes as inputs …

被引用次数：8 相关文章所有 11 个版本

An analysis of performance evaluation metrics for voice conversion models

MT Akhter, P Banerjee, S Dhar… - 2022 IEEE 19th India …, 2022 - ieeexplore.ieee.org

The process of transforming a source speaker's vocal style or vocal feature to that of a target
speaker while keeping the linguistic information of the source speaker unchanged is known …

被引用次数：3 相关文章

[引用][C] Non-Parallel Many-to-Many Voice Conversion with PSR-StarGAN.

Y Li, D Xu, Y Zhang, Y Wang, B Chen - Interspeech, 2020

被引用次数：6 相关文章所有 3 个版本

[PDF] arxiv.org

Learning Noise-independent Speech Representation for High-quality Voice Conversion for Noisy Target Speakers

L Xue, S Yang, N Hu, D Su, L Xie - arXiv preprint arXiv:2207.00756, 2022 - arxiv.org

Building a voice conversion system for noisy target speakers, such as users providing noisy
samples or Internet found data, is a challenging task since the use of contaminated speech …

被引用次数：1 相关文章所有 5 个版本

[PDF] ieee.org

Non-parallel voice conversion system with WaveNet vocoder and collapsed speech suppression

YC Wu, PL Tobing, K Kobayashi, T Hayashi… - IEEE Access, 2020 - ieeexplore.ieee.org

In this paper, we integrate a simple non-parallel voice conversion (VC) system with a
WaveNet (WN) vocoder and a proposed collapsed speech suppression technique. The …

被引用次数：7 相关文章所有 6 个版本

[PDF] arxiv.org

Spectrum and prosody conversion for cross-lingual voice conversion with cyclegan

Z Du, K Zhou, B Sisman, H Li - 2020 Asia-Pacific Signal and …, 2020 - ieeexplore.ieee.org

Cross-lingual voice conversion aims to change source speaker's voice to sound like that of
target speaker, when source and target speakers speak different languages. It relies on …

被引用次数：7 相关文章所有 5 个版本

[PDF] hal.science

End-to-End Modeling for Speech Spoofing and Deepfake Detection

H Tak - 2023 - theses.fr

Résumé Les systèmes biométriques vocaux sont utilisés dans diverses applications pour
une authentification sécurisée. Toutefois, ces systèmes sont vulnérables aux attaques par …

被引用次数：1 相关文章所有 7 个版本

[PDF] arxiv.org

Unsupervised audiovisual synthesis via exemplar autoencoders

K Deng, A Bansal, D Ramanan - arXiv preprint arXiv:2001.04463, 2020 - arxiv.org

We present an unsupervised approach that converts the input speech of any individual into
audiovisual streams of potentially-infinitely many output speakers. Our approach builds on …

被引用次数：7 相关文章所有 6 个版本

Singing voice conversion based on wd-gan algorithm

W Zhao, W Wang, Y Sun, T Tang - 2019 IEEE 4th Advanced …, 2019 - ieeexplore.ieee.org

The research of singing voice conversion (SVC) has attracted more and more attention in
the field of artificial intelligence. This paper proposes a WD-GAN algorithm for singing voice …

被引用次数：6 相关文章

A deep learning based method for blind recognition of LDPC codes

L Li, L Xie, Z Huang, C Liu, J Zhou… - 2021 2nd Information …, 2021 - ieeexplore.ieee.org

Deep learning is an emerging research direction in machine learning, which has a
promising application in the field of communications. In this paper, we focus on adaptive …

被引用次数：3 相关文章

高级搜索

QQ 群