Rhythm-flexible voice conversion without parallel data using cycle-gan over phoneme posteriorgram sequences

C Yeh, P Hsu, J Chou, H Lee… - 2018 IEEE Spoken …, 2018 - ieeexplore.ieee.org
Speaking rate refers to the average number of phonemes within some unit time, while the
rhythmic patterns refer to duration distributions for realizations of different phonemes within …

[PDF][PDF] RHYTHM-FLEXIBLE VOICE CONVERSION WITHOUT PARALLEL DATA USING CYCLE-GAN OVER PHONEME POSTERIORGRAM SEQUENCES

C Yeh, P Hsu, J Chou, H Lee, L Lee - speech.ee.ntu.edu.tw
Speaking rate refers to the average number of phonemes within some unit time, while the
rhythmic patterns refer to duration distributions for realizations of different phonemes within …

[引用][C] Rhythm-Flexible Voice Conversion Without Parallel Data Using Cycle-GAN Over Phoneme Posteriorgram Sequences

C Yeh, P Hsu, J Chou, H Lee, L Lee - 2018 IEEE Spoken Language …, 2018 - cir.nii.ac.jp
Rhythm-Flexible Voice Conversion Without Parallel Data Using Cycle-GAN Over Phoneme
Posteriorgram Sequences | CiNii Research CiNii 国立情報学研究所 学術情報ナビゲータ[サイニィ] …

Rhythm-Flexible Voice Conversion without Parallel Data Using Cycle-GAN over Phoneme Posteriorgram Sequences

C Yeh, P Hsu, J Chou, H Lee, L Lee - arXiv e-prints, 2018 - ui.adsabs.harvard.edu
Speaking rate refers to the average number of phonemes within some unit time, while the
rhythmic patterns refer to duration distributions for realizations of different phonemes within …

Rhythm-Flexible Voice Conversion without Parallel Data Using Cycle-GAN over Phoneme Posteriorgram Sequences

C Yeh, P Hsu, J Chou, H Lee, L Lee - arXiv preprint arXiv:1808.03113, 2018 - arxiv.org
Speaking rate refers to the average number of phonemes within some unit time, while the
rhythmic patterns refer to duration distributions for realizations of different phonemes within …

[PDF][PDF] RHYTHM-FLEXIBLE VOICE CONVERSION WITHOUT PARALLEL DATA USING CYCLE-GAN OVER PHONEME POSTERIORGRAM SEQUENCES

C Yeh, P Hsu, J Chou, H Lee, L Lee - speech.ee.ntu.edu.tw
Speaking rate refers to the average number of phonemes within some unit time, while the
rhythmic patterns refer to duration distributions for realizations of different phonemes within …