Voice conversion aims to modify the source speaker's voice to resemble the target speaker while preserving the original speech content. Despite notable advancements in voice …
In this paper, we propose GlowVC: a multilingual multi-speaker flow-based model for language-independent text-free voice conversion. We build on Glow-TTS, which provides an …
Y Zhou, Z Wu, X Tian, H Li - IEEE/ACM Transactions on Audio …, 2023 - ieeexplore.ieee.org
Cross-lingual voice conversion (XVC) transforms the speaker identity of a source speaker to that of a target speaker who speaks a different language. Due to the intrinsic differences …
H Guo, C Liu, CT Ishi, H Ishiguro - 2023 IEEE Automatic Speech …, 2023 - ieeexplore.ieee.org
Voice conversion systems have made significant advancements in terms of naturalness and similarity in common voice conversion tasks. However, their performance in more complex …
A Avdeeva, A Gusev - arXiv preprint arXiv:2408.11528, 2024 - arxiv.org
Zero-shot voice conversion aims to transfer the voice of a source speaker to that of a speaker unseen during training, while preserving the content information. Although various …
This paper presents a method for end-to-end cross-lingual text-to-speech (TTS) which aims to preserve the target language's pronunciation regardless of the original speaker's …
Abstract Cross-Lingual Voice Conversion (XVC) aims to change the identity of a source speaker towards a target speaker while preserving the content. In particular, the source and …