Cross-lingual voice conversion with bilingual phonetic posteriorgram and average modeling Y Zhou, X Tian, H Xu, RK Das, H Li ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 85 | 2019 |
Transfer learning from speech synthesis to voice conversion with non-parallel training data M Zhang, Y Zhou, L Zhao, H Li IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1290-1302, 2021 | 57 | 2021 |
Language agnostic speaker embedding for cross-lingual personalized speech generation Y Zhou, X Tian, H Li IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 3427-3439, 2021 | 16 | 2021 |
Speaker-independent spectral mapping for speech-to-singing conversion X Gao, X Tian, RK Das, Y Zhou, H Li 2019 Asia-Pacific Signal and Information Processing Association Annual …, 2019 | 14 | 2019 |
A modularized neural network with language-specific output layers for cross-lingual voice conversion Y Zhou, X Tian, E Yılmaz, RK Das, H Li 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 13 | 2019 |
Multi-task waveRNN with an integrated architecture for cross-lingual voice conversion Y Zhou, X Tian, H Li IEEE Signal Processing Letters 27, 1310-1314, 2020 | 11 | 2020 |
Personalized Singing Voice Generation Using WaveRNN. X Gao, X Tian, Y Zhou, RK Das, H Li Odyssey, 252-258, 2020 | 11 | 2020 |
Many-to-many cross-lingual voice conversion with a jointly trained speaker embedding network Y Zhou, X Tian, RK Das, H Li 2019 Asia-Pacific Signal and Information Processing Association Annual …, 2019 | 11 | 2019 |
The NUS & NWPU system for voice conversion challenge 2020 X Tian, Z Wang, S Yang, X Zhou, H Du, Y Zhou, M Zhang, K Zhou, ... Proc. Joint Workshop for the Blizzard Challenge and Voice Conversion …, 2020 | 10 | 2020 |
Accented text-to-speech synthesis with limited data X Zhou, M Zhang, Y Zhou, Z Wu, H Li IEEE/ACM Transactions on Audio, Speech, and Language Processing 32, 1699-1711, 2024 | 6 | 2024 |
Cross-Lingual Voice Conversion with a Cycle Consistency Loss on Linguistic Representation. Y Zhou, X Tian, Z Wu, H Li Interspeech, 1374-1378, 2021 | 6 | 2021 |
Tts-guided training for accent conversion without parallel data Y Zhou, Z Wu, M Zhang, X Tian, H Li IEEE Signal Processing Letters 30, 533-537, 2023 | 5 | 2023 |
Optimization of cross-lingual voice conversion with linguistics losses to reduce foreign accents Y Zhou, Z Wu, X Tian, H Li IEEE/ACM Transactions on Audio, Speech, and Language Processing 31, 1916-1926, 2023 | 3 | 2023 |
Mel-S3R: Combining Mel-spectrogram and self-supervised speech representation with VQ-VAE for any-to-any voice conversion J Yang, Y Zhou, H Huang Speech Communication 151, 52-63, 2023 | 2 | 2023 |
Combining wav2vec 2.0 fine-tuning and ConLearnNet for speech emotion recognition C Sun, Y Zhou, X Huang, J Yang, X Hou Electronics 13 (6), 1103, 2024 | 1 | 2024 |
Zero-shot multi-speaker accent TTS with limited accent data M Zhang, Y Zhou, Z Wu, H Li 2023 Asia Pacific Signal and Information Processing Association Annual …, 2023 | 1 | 2023 |
RefXVC: Cross-Lingual Voice Conversion with Enhanced Reference Leveraging M Zhang, Y Zhou, Y Ren, C Zhang, X Yin, H Li arXiv preprint arXiv:2406.16326, 2024 | | 2024 |
Multi-Scale Accent Modeling with Disentangling for Multi-Speaker Multi-Accent TTS Synthesis X Zhou, M Zhang, Y Zhou, Z Wu, H Li arXiv preprint arXiv:2406.10844, 2024 | | 2024 |
Cross-Lingual Voice Conversion Y Zhou PQDT-Global, 2022 | | 2022 |
Mel-S3r: Combining Mel-Spectrogram and Self-Supervised Speech Representation for Voice Conversion J Yang, Y Zhou, H Huang Available at SSRN 4183301, 0 | | |