Audio Deepfake Approaches

OA Shaaban, R Yildirim, AA Alguttar - IEEE Access, 2023 - ieeexplore.ieee.org
This paper presents a review of techniques involved in the creation and detection of audio
deepfakes, the first section provides information about general deep fakes. In the second …

Seismic impedance inversion based on geophysical-guided cycle-consistent generative adversarial networks

H Zhang, G Zhang, J Gao, S Li, J Zhang… - Journal of Petroleum …, 2022 - Elsevier
Deep learning algorithms have shown great potential in geophysical areas such as seismic
interpretation and seismic inversion. However, when applied to seismic inversion, high …

On the study of generative adversarial networks for cross-lingual voice conversion

B Sisman, M Zhang, M Dong… - 2019 IEEE Automatic …, 2019 - ieeexplore.ieee.org
Cross-lingual voice conversion (VC) aims to convert the source speaker's voice to sound like
that of the target speaker, when the source and target speakers speak different languages …

SpeakerGAN: Speaker identification with conditional generative adversarial network

L Chen, Y Liu, W Xiao, Y Wang, H Xie - Neurocomputing, 2020 - Elsevier
Current methods based on the traditional i-vectors and deep neural network (DNN) have
shown effectiveness on the speaker identification task, especially with the corpus of large …

crank: An open-source software for nonparallel voice conversion based on vector-quantized variational autoencoder

K Kobayashi, WC Huang, YC Wu… - ICASSP 2021-2021 …, 2021 - ieeexplore.ieee.org
In this paper, we present an open-source software for developing a nonparallel voice
conversion (VC) system named crank. Although we have released an open-source VC …

Voice conversion for whispered speech synthesis

M Cotescu, T Drugman, G Huybrechts… - IEEE Signal …, 2019 - ieeexplore.ieee.org
We present an approach to synthesize whisper by applying a handcrafted signal processing
recipe and Voice Conversion (VC) techniques to convert normally phonated speech to …

[HTML][HTML] Speech synthesis using generative adversarial network for improving readability of Hindi words to recuperate from dyslexia

G Atkar, P Jayaraju - Neural Computing and Applications, 2021 - Springer
Children learn and develop their abilities at their own pace. One of the most basic skills that
they acquire is reading. However, some children struggle with reading longer than their …

Low-resource domain adaptation for speaker recognition using cycle-gans

PS Nidadavolu, S Kataria, J Villalba… - 2019 IEEE Automatic …, 2019 - ieeexplore.ieee.org
Current speaker recognition technology provides great performance with the x-vector
approach. However, performance decreases when the evaluation domain is different from …

Cycle-gans for domain adaptation of acoustic features for speaker recognition

PS Nidadavolu, J Villalba… - ICASSP 2019-2019 IEEE …, 2019 - ieeexplore.ieee.org
It is well known that domain mismatch between the training and evaluation data hinders the
performance of any machine learning system. Various factors contribute to domain …

Analysis of gender and identity issues in depression detection on de-identified speech

P Lopez-Otero, L Docio-Fernandez - Computer Speech & Language, 2021 - Elsevier
Research in the area of automatic monitoring of emotional state from speech permits
envisaging future novel applications for the remote monitoring of some common mental …