Cyclegan voice conversion of spectral envelopes using adversarial weights

R Ferro, N Obin, A Roebel - 2020 28th European Signal …, 2021 - ieeexplore.ieee.org
This paper tackles GAN optimization and stability issues in the context of voice conversion.
First, to simplify the conversion task, we propose to use spectral envelopes as inputs …

An analysis of performance evaluation metrics for voice conversion models

MT Akhter, P Banerjee, S Dhar… - 2022 IEEE 19th India …, 2022 - ieeexplore.ieee.org
The process of transforming a source speaker's vocal style or vocal feature to that of a target
speaker while keeping the linguistic information of the source speaker unchanged is known …

[引用][C] Non-Parallel Many-to-Many Voice Conversion with PSR-StarGAN.

Y Li, D Xu, Y Zhang, Y Wang, B Chen - Interspeech, 2020

Learning Noise-independent Speech Representation for High-quality Voice Conversion for Noisy Target Speakers

L Xue, S Yang, N Hu, D Su, L Xie - arXiv preprint arXiv:2207.00756, 2022 - arxiv.org
Building a voice conversion system for noisy target speakers, such as users providing noisy
samples or Internet found data, is a challenging task since the use of contaminated speech …

Non-parallel voice conversion system with WaveNet vocoder and collapsed speech suppression

YC Wu, PL Tobing, K Kobayashi, T Hayashi… - IEEE Access, 2020 - ieeexplore.ieee.org
In this paper, we integrate a simple non-parallel voice conversion (VC) system with a
WaveNet (WN) vocoder and a proposed collapsed speech suppression technique. The …

Spectrum and prosody conversion for cross-lingual voice conversion with cyclegan

Z Du, K Zhou, B Sisman, H Li - 2020 Asia-Pacific Signal and …, 2020 - ieeexplore.ieee.org
Cross-lingual voice conversion aims to change source speaker's voice to sound like that of
target speaker, when source and target speakers speak different languages. It relies on …

End-to-End Modeling for Speech Spoofing and Deepfake Detection

H Tak - 2023 - theses.fr
Résumé Les systèmes biométriques vocaux sont utilisés dans diverses applications pour
une authentification sécurisée. Toutefois, ces systèmes sont vulnérables aux attaques par …

Unsupervised audiovisual synthesis via exemplar autoencoders

K Deng, A Bansal, D Ramanan - arXiv preprint arXiv:2001.04463, 2020 - arxiv.org
We present an unsupervised approach that converts the input speech of any individual into
audiovisual streams of potentially-infinitely many output speakers. Our approach builds on …

Singing voice conversion based on wd-gan algorithm

W Zhao, W Wang, Y Sun, T Tang - 2019 IEEE 4th Advanced …, 2019 - ieeexplore.ieee.org
The research of singing voice conversion (SVC) has attracted more and more attention in
the field of artificial intelligence. This paper proposes a WD-GAN algorithm for singing voice …

A deep learning based method for blind recognition of LDPC codes

L Li, L Xie, Z Huang, C Liu, J Zhou… - 2021 2nd Information …, 2021 - ieeexplore.ieee.org
Deep learning is an emerging research direction in machine learning, which has a
promising application in the field of communications. In this paper, we focus on adaptive …