Seen and unseen emotional style transfer for voice conversion with a new emotional speech dataset

K Zhou, B Sisman, R Liu, H Li - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
Emotional voice conversion aims to transform emotional prosody in speech while preserving
the linguistic content and speaker identity. Prior studies show that it is possible to …

Converting anyone's emotion: Towards speaker-independent emotional voice conversion

K Zhou, B Sisman, M Zhang, H Li - arXiv preprint arXiv:2005.07025, 2020 - arxiv.org
Emotional voice conversion aims to convert the emotion of speech from one state to another
while preserving the linguistic content and speaker identity. The prior studies on emotional …

Transforming spectrum and prosody for emotional voice conversion with non-parallel training data

K Zhou, B Sisman, H Li - arXiv preprint arXiv:2002.00198, 2020 - arxiv.org
Emotional voice conversion aims to convert the spectrum and prosody to change the
emotional patterns of speech, while preserving the speaker identity and linguistic content …

Vaw-gan for disentanglement and recomposition of emotional elements in speech

K Zhou, B Sisman, H Li - 2021 IEEE spoken language …, 2021 - ieeexplore.ieee.org
Emotional voice conversion (EVC) aims to convert the emotion of speech from one state to
another while preserving the linguistic content and speaker identity. In this paper, we study …

Emotion intensity and its control for emotional voice conversion

K Zhou, B Sisman, R Rana… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Emotional voice conversion (EVC) seeks to convert the emotional state of an utterance while
preserving the linguistic content and speaker identity. In EVC, emotions are usually treated …

Nonparallel emotional speech conversion

J Gao, D Chakraborty, H Tembine… - arXiv preprint arXiv …, 2018 - arxiv.org
We propose a nonparallel data-driven emotional speech conversion method. It enables the
transfer of emotion-related characteristics of a speech signal while preserving the speaker's …

Textless speech emotion conversion using discrete and decomposed representations

F Kreuk, A Polyak, J Copet, E Kharitonov… - arXiv preprint arXiv …, 2021 - arxiv.org
Speech emotion conversion is the task of modifying the perceived emotion of a speech
utterance while preserving the lexical content and speaker identity. In this study, we cast the …

Sequence-to-sequence modelling of f0 for speech emotion conversion

C Robinson, N Obin, A Roebel - ICASSP 2019-2019 IEEE …, 2019 - ieeexplore.ieee.org
Voice interfaces are becoming wildly popular and driving demand for more advanced
speech synthesis and voice transformation systems. Current text-to-speech methods …

Stargan for emotional speech conversion: Validated by data augmentation of end-to-end emotion recognition

G Rizos, A Baird, M Elliott… - ICASSP 2020-2020 IEEE …, 2020 - ieeexplore.ieee.org
In this paper, we propose an adversarial network implementation for speech emotion
conversion as a data augmentation method, validated by a multi-class speech affect …

MelGAN-VC: Voice conversion and audio style transfer on arbitrarily long samples using spectrograms

M Pasini - arXiv preprint arXiv:1910.03713, 2019 - arxiv.org
Traditional voice conversion methods rely on parallel recordings of multiple speakers
pronouncing the same sentences. For real-world applications however, parallel data is …