Emotional voice conversion: Theory, databases and ESD

K Zhou, B Sisman, R Liu, H Li - Speech Communication, 2022 - Elsevier
In this paper, we first provide a review of the state-of-the-art emotional voice conversion
research, and the existing emotional speech databases. We then motivate the development …

Limited data emotional voice conversion leveraging text-to-speech: Two-stage sequence-to-sequence training

K Zhou, B Sisman, H Li - arXiv preprint arXiv:2103.16809, 2021 - arxiv.org
Emotional voice conversion (EVC) aims to change the emotional state of an utterance while
preserving the linguistic content and speaker identity. In this paper, we propose a novel 2 …

Emotion intensity and its control for emotional voice conversion

K Zhou, B Sisman, R Rana… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Emotional voice conversion (EVC) seeks to convert the emotional state of an utterance while
preserving the linguistic content and speaker identity. In EVC, emotions are usually treated …

Converting anyone's emotion: Towards speaker-independent emotional voice conversion

K Zhou, B Sisman, M Zhang, H Li - arXiv preprint arXiv:2005.07025, 2020 - arxiv.org
Emotional voice conversion aims to convert the emotion of speech from one state to another
while preserving the linguistic content and speaker identity. The prior studies on emotional …

Seen and unseen emotional style transfer for voice conversion with a new emotional speech dataset

K Zhou, B Sisman, R Liu, H Li - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
Emotional voice conversion aims to transform emotional prosody in speech while preserving
the linguistic content and speaker identity. Prior studies show that it is possible to …

Sequence-to-sequence modelling of f0 for speech emotion conversion

C Robinson, N Obin, A Roebel - ICASSP 2019-2019 IEEE …, 2019 - ieeexplore.ieee.org
Voice interfaces are becoming wildly popular and driving demand for more advanced
speech synthesis and voice transformation systems. Current text-to-speech methods …

The emotional voices database: Towards controlling the emotion dimension in voice generation systems

A Adigwe, N Tits, KE Haddad, S Ostadabbas… - arXiv preprint arXiv …, 2018 - arxiv.org
In this paper, we present a database of emotional speech intended to be open-sourced and
used for synthesis and generation purpose. It contains data for male and female actors in …

Data-driven emotion conversion in spoken English

Z Inanoglu, S Young - Speech Communication, 2009 - Elsevier
This paper describes an emotion conversion system that combines independent parameter
transformation techniques to endow a neutral utterance with a desired target emotion. A set …

Prosody conversion from neutral speech to emotional speech

J Tao, Y Kang, A Li - IEEE transactions on Audio, Speech, and …, 2006 - ieeexplore.ieee.org
Emotion is an important element in expressive speech synthesis. Unlike traditional discrete
emotion simulations, this paper attempts to synthesize emotional speech by using" strong"," …

[PDF][PDF] GMM-based emotional voice conversion using spectrum and prosody features

R Aihara, R Takashima… - American Journal of …, 2012 - me.cs.scitec.kobe-u.ac.jp
Abstract We propose Gaussian Mixture Model (GMM)-based emotional voice conversion
using spectrum and prosody features. In recent years, speech recognition and synthesis …