Accent and speaker disentanglement in many-to-many voice conversion

Z Wang, W Ge, X Wang, S Yang, W Gan… - … on Chinese Spoken …, 2021 - ieeexplore.ieee.org
This paper proposes an interesting voice and accent joint conversion approach, which can
convert an arbitrary source speaker's voice to a target speaker with non-native accent. This …

Language agnostic speaker embedding for cross-lingual personalized speech generation

Y Zhou, X Tian, H Li - IEEE/ACM Transactions on Audio …, 2021 - ieeexplore.ieee.org
Cross-lingual personalized speech generation seeks to synthesize a target speaker's voice
from only a few training samples that are in a different language. One popular technique is to …

Disentangled speech representation learning for one-shot cross-lingual voice conversion using ß-vae

H Lu, D Wang, X Wu, Z Wu, X Liu… - 2022 IEEE Spoken …, 2023 - ieeexplore.ieee.org
We propose an unsupervised learning method to disentangle speech into content
representation and speaker identity representation. We apply this method to the challenging …

Optimization of cross-lingual voice conversion with linguistics losses to reduce foreign accents

Y Zhou, Z Wu, X Tian, H Li - IEEE/ACM Transactions on Audio …, 2023 - ieeexplore.ieee.org
Cross-lingual voice conversion (XVC) transforms the speaker identity of a source speaker to
that of a target speaker who speaks a different language. Due to the intrinsic differences …

Multi-task waveRNN with an integrated architecture for cross-lingual voice conversion

Y Zhou, X Tian, H Li - IEEE Signal Processing Letters, 2020 - ieeexplore.ieee.org
Spoken languages are similar phonetically because humans have a common vocal
production system. However, each language has a unique phonetic repertoire and …

[PDF][PDF] Cross-Lingual Voice Conversion with a Cycle Consistency Loss on Linguistic Representation.

Y Zhou, X Tian, Z Wu, H Li - Interspeech, 2021 - isca-archive.org
Abstract Cross-Lingual Voice Conversion (XVC) aims to modify a source speaker identity
towards a target while preserving the source linguistic content. This paper introduces a cycle …

[HTML][HTML] 面向非平行语料的语音转换技术综述

李鹏程, 张旭龙, 王健宗, 程宁, 肖京 - 大数据, 2024 - infocomm-journal.com
李鹏程(1999-), 男, 中国科学技术大学硕士生, 平安科技(深圳) 有限公司算法工程师,
主要研究方向为语音合成, 语音转换和语音安全等.张旭龙(1988-), 男, 博士, 平安科技(深圳) …

[PDF][PDF] Submission from SRCB for voice conversion challenge 2020

Q Ma, R Liu, X Wen, C Lu, X Chen - Proc. Joint Workshop for the …, 2020 - isca-archive.org
This paper presents the intra-lingual and cross-lingual voice conversion system for Voice
Conversion Challenge 2020 (VCC 2020). Voice conversion (VC) modifies a source …

[PDF][PDF] Cross-Lingual Voice Conversion with Disentangled Universal Linguistic Representations.

Z Yang, W Zhang, Y Liu, X Xing - Interspeech, 2021 - isca-archive.org
Intra-lingual voice conversion has achieved great progress recently in terms of naturalness
and similarity. However, in crosslingual voice conversion, there is still an urgent need to …

RAVE for Speech: Efficient Voice Conversion at High Sampling Rates

AR Bargum, S Lajboschitz, C Erkut - arXiv preprint arXiv:2408.16546, 2024 - arxiv.org
Voice conversion has gained increasing popularity within the field of audio manipulation
and speech synthesis. Often, the main objective is to transfer the input identity to that of a …