VQVAE Unsupervised Unit Discovery and Multi-scale Code2Spec Inverter for Zerospeech Challenge 2019 A Tjandra, B Sisman, M Zhang, S Sakti, H Li, S Nakamura arXiv preprint arXiv:1905.11449, 2019 | 86 | 2019 |
Joint Training Framework for Text-to-Speech and Voice Conversion Using Multi-Source Tacotron and WaveNet M Zhang, X Wang, F Fang, H Li, J Yamagishi Interspeech, 1298-1302, 2019 | 73 | 2019 |
A Voice Conversion Framework with Tandem Feature Sparse Representation and Speaker-Adapted WaveNet Vocoder. B Sisman, M Zhang, H Li Interspeech, 1978-1982, 2018 | 60 | 2018 |
Converting Anyone's Emotion: Towards Speaker-Independent Emotional Voice Conversion K Zhou, B Sisman, M Zhang, H Li arXiv preprint arXiv:2005.07025, 2020 | 57 | 2020 |
Transfer learning from speech synthesis to voice conversion with non-parallel training data M Zhang, Y Zhou, L Zhao, H Li IEEE/ACM Transactions on Audio, Speech, and Language Processing 29, 1290-1302, 2021 | 55 | 2021 |
Group Sparse Representation With WaveNet Vocoder Adaptation for Spectrum and Prosody Conversion B Sisman, M Zhang, H Li IEEE/ACM Transactions on Audio, Speech, and Language Processing 27 (6), 1085 …, 2019 | 51 | 2019 |
Adaptive wavenet vocoder for residual compensation in gan-based voice conversion B Sisman, M Zhang, S Sakti, H Li, S Nakamura 2018 IEEE Spoken Language Technology Workshop (SLT), 282-289, 2018 | 45 | 2018 |
On the study of generative adversarial networks for cross-lingual voice conversion B Sisman, M Zhang, M Dong, H Li 2019 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2019 | 39 | 2019 |
Deepconversion: Voice conversion with limited parallel training data M Zhang, B Sisman, L Zhao, H Li Speech Communication 122, 31-43, 2020 | 16 | 2020 |
Error reduction network for dblstm-based voice conversion M Zhang, B Sisman, SS Rallabandi, H Li, L Zhao 2018 Asia-Pacific Signal and Information Processing Association Annual …, 2018 | 16 | 2018 |
VisualTTS: TTS with accurate lip-speech synchronization for automatic voice over J Lu, B Sisman, R Liu, M Zhang, H Li ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022 | 11 | 2022 |
The NUS & NWPU System for Voice Conversion Challenge 2020 X Tian, Z Wang, S Yang, X Zhou, H Du, Y Zhou, M Zhang, K Zhou, ... Proc. Joint Workshop for the Blizzard Challenge and Voice Conversion …, 2020 | 10 | 2020 |
Accented Text-to-Speech Synthesis with Limited Data X Zhou, M Zhang, Y Zhou, Z Wu, H Li arXiv preprint arXiv:2305.04816, 2023 | 6 | 2023 |
TTS-Guided Training for Accent Conversion Without Parallel Data Y Zhou, Z Wu, M Zhang, X Tian, H Li IEEE Signal Processing Letters, 2023 | 5 | 2023 |
Towards Zero-Shot Multi-Speaker Multi-Accent Text-to-Speech Synthesis M Zhang, X Zhou, Z Wu, H Li IEEE Signal Processing Letters, 2023 | 2 | 2023 |
Zero-shot multi-speaker accent TTS with limited accent data M Zhang, Y Zhou, Z Wu, H Li 2023 Asia Pacific Signal and Information Processing Association Annual …, 2023 | 1 | 2023 |
High-Quality Automatic Voice Over with Accurate Alignment: Supervision through Self-Supervised Discrete Speech Units J Lu, B Sisman, M Zhang, H Li arXiv preprint arXiv:2306.17005, 2023 | 1 | 2023 |
DeepA: A Deep Neural Analyzer For Speech And Singing Vocoding S Nikonorov, B Sisman, M Zhang, H Li 2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU …, 2021 | 1 | 2021 |
Speech Recognition and Synthesis Algorithm for Digital Hearing Aids under Background Noise MY Zhang, CR Zou, RY Liang, L Zhao 2016 International Conference on Information System and Artificial …, 2016 | 1 | 2016 |
NUS-HLT System for Blizzard Challenge 2020 Y Zhou, X Tian, X Zhou, M Zhang, G Lee, R Liu, B Sisman, H Li, I Pillar Proc. Joint Workshop for the Blizzard Challenge and Voice Conversion …, 2020 | | 2020 |