CM Chien, H Lee - arXiv e-prints, 2020 - ui.adsabs.harvard.edu
Prosody modeling is an essential component in modern text-to-speech (TTS) frameworks. By explicitly providing prosody features to the TTS model, the style of synthesized utterances …
CM Chien, H Lee - arXiv preprint arXiv:2011.06465, 2020 - arxiv.org
Prosody modeling is an essential component in modern text-to-speech (TTS) frameworks. By explicitly providing prosody features to the TTS model, the style of synthesized utterances …