X Di, Z Chen, Y Liang, J Zheng, Y Wang… - arXiv preprint arXiv …, 2024 - arxiv.org
Large-scale text-to-speech (TTS) models have made significant progress recently. However,
they still fall short in the generation of Chinese dialectal speech. Toaddress this, we propose …