Z Liu,
Y Guo,
K Yu - ICASSP 2023-2023 IEEE International …, 2023 - ieeexplore.ieee.org
In this work, we present DiffVoice, a novel text-to-speech model based on latent diffusion.
We propose to first encode speech signals into a phoneme-rate latent representation with a …