High-quality nonparallel voice conversion based on cycle-consistent adversarial network

OA Shaaban, R Yildirim, AA Alguttar - IEEE Access, 2023 - ieeexplore.ieee.org

This paper presents a review of techniques involved in the creation and detection of audio
deepfakes, the first section provides information about general deep fakes. In the second …

被引用次数：9 相关文章所有 3 个版本

Seismic impedance inversion based on geophysical-guided cycle-consistent generative adversarial networks

H Zhang, G Zhang, J Gao, S Li, J Zhang… - Journal of Petroleum …, 2022 - Elsevier

Deep learning algorithms have shown great potential in geophysical areas such as seismic
interpretation and seismic inversion. However, when applied to seismic inversion, high …

被引用次数：11 相关文章所有 2 个版本

[PDF] a-star.edu.sg

On the study of generative adversarial networks for cross-lingual voice conversion

B Sisman, M Zhang, M Dong… - 2019 IEEE Automatic …, 2019 - ieeexplore.ieee.org

Cross-lingual voice conversion (VC) aims to convert the source speaker's voice to sound like
that of the target speaker, when the source and target speakers speak different languages …

被引用次数：39 相关文章所有 4 个版本

SpeakerGAN: Speaker identification with conditional generative adversarial network

L Chen, Y Liu, W Xiao, Y Wang, H Xie - Neurocomputing, 2020 - Elsevier

Current methods based on the traditional i-vectors and deep neural network (DNN) have
shown effectiveness on the speaker identification task, especially with the corpus of large …

被引用次数：29 相关文章

[PDF] arxiv.org

crank: An open-source software for nonparallel voice conversion based on vector-quantized variational autoencoder

K Kobayashi, WC Huang, YC Wu… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org

In this paper, we present an open-source software for developing a nonparallel voice
conversion (VC) system named crank. Although we have released an open-source VC …

被引用次数：21 相关文章所有 3 个版本

[PDF] arxiv.org

Voice conversion for whispered speech synthesis

M Cotescu, T Drugman, G Huybrechts… - IEEE Signal …, 2019 - ieeexplore.ieee.org

We present an approach to synthesize whisper by applying a handcrafted signal processing
recipe and Voice Conversion (VC) techniques to convert normally phonated speech to …

被引用次数：35 相关文章所有 8 个版本

[HTML] springer.com

[HTML][HTML] Speech synthesis using generative adversarial network for improving readability of Hindi words to recuperate from dyslexia

G Atkar, P Jayaraju - Neural Computing and Applications, 2021 - Springer

Children learn and develop their abilities at their own pace. One of the most basic skills that
they acquire is reading. However, some children struggle with reading longer than their …

被引用次数：20 相关文章所有 8 个版本

[PDF] arxiv.org

Low-resource domain adaptation for speaker recognition using cycle-gans

PS Nidadavolu, S Kataria, J Villalba… - 2019 IEEE Automatic …, 2019 - ieeexplore.ieee.org

Current speaker recognition technology provides great performance with the x-vector
approach. However, performance decreases when the evaluation domain is different from …

被引用次数：33 相关文章所有 4 个版本

Cycle-gans for domain adaptation of acoustic features for speaker recognition

PS Nidadavolu, J Villalba… - ICASSP 2019-2019 IEEE …, 2019 - ieeexplore.ieee.org

It is well known that domain mismatch between the training and evaluation data hinders the
performance of any machine learning system. Various factors contribute to domain …

被引用次数：35 相关文章

Analysis of gender and identity issues in depression detection on de-identified speech

P Lopez-Otero, L Docio-Fernandez - Computer Speech & Language, 2021 - Elsevier

Research in the area of automatic monitoring of emotional state from speech permits
envisaging future novel applications for the remote monitoring of some common mental …

被引用次数：25 相关文章

高级搜索

QQ 群