Non-parallel voice conversion system with wavenet vocoder and collapsed speech suppression

YC Wu, PL Tobing, K Kobayashi, T Hayashi… - IEEE Access, 2020 - ieeexplore.ieee.org
In this paper, we integrate a simple non-parallel voice conversion (VC) system with a
WaveNet (WN) vocoder and a proposed collapsed speech suppression technique. The …

[PDF][PDF] System description: Speaker anonymization by pitch shifting based on time-scale modification (pv-tsm)

CO Mawalim, S Okada, M Unoki - URL: https://www …, 2022 - voiceprivacychallenge.org
The increasing usage of speech in digital technology raises a privacy issue because speech
contains biometric information. Several methods of dealing with this issue have been …

Dysarthric Speech Enhancement Based on Convolution Neural Network

SS Wang, Y Tsao, WZ Zheng, HW Yeh… - 2022 44th Annual …, 2022 - ieeexplore.ieee.org
Generally, those patients with dysarthria utter a distorted sound and the restrained
intelligibility of a speech for both human and machine. To enhance the intelligibility of …

From signal representation to representation learning: structured modeling of speech signals

N Obin - 2023 - hal.science
This habilitation presents the last ten years of my research on the structured modelling of
speech signals. Speech, as an oral language, constitutes the most elaborate communication …

Singing voice conversion based on wd-gan algorithm

W Zhao, W Wang, Y Sun, T Tang - 2019 IEEE 4th Advanced …, 2019 - ieeexplore.ieee.org
The research of singing voice conversion (SVC) has attracted more and more attention in
the field of artificial intelligence. This paper proposes a WD-GAN algorithm for singing voice …

Towards the development of accent conversion model for (l1) bengali speaker using cycle consistent adversarial network (cyclegan)

S Chandra, P Bharati… - 2022 25th Conference of …, 2022 - ieeexplore.ieee.org
The goal of foreign accent conversion (FAC) is to create a new voice with the voice identity
of a given second-language (L2) speaker but with a native (L1) accent. The main motivation …

Novel inter mixture weighted GMM posteriorgram for DNN and GAN-based voice conversion

NJ Shah, R Sreeraj, N Shah… - 2018 Asia-Pacific Signal …, 2018 - ieeexplore.ieee.org
Voice Conversion (VC) requires an alignment of the spectral features before learning the
mapping function, due to the speaking rate variations across the source and target speakers …

[PDF][PDF] A hybrid CNN-LSTM model with adaptive instance normalization for one shot singing voice conversion

A Yousuf, DS George - AIMS Electronics and Electrical Engineering, 2024 - aimspress.com
Singing voice conversion methods encounter challenges in achieving a delicate balance
between synthesis quality and singer similarity. Traditional voice conversion techniques …

CycleGAN-Based Singing/Humming to Instrument Conversion Technique

WH Lai, SL Wang, ZY Xu - Electronics, 2022 - mdpi.com
In this research, singing/humming to instrument conversion techniques are proposed. In
humming to instrument, two models based on cycle-consistent adversarial networks …

Unsupervised Musical Timbre Transfer for Notification Sounds

J Yang, T Cinquin, G Sörös - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
We present a method to transform artificial notification sounds into various musical timbres.
To tackle the issues of ambiguous timbre definition, the lack of paired notification-music …