Grad-tts: A diffusion probabilistic model for text-to-speech V Popov, I Vovk, V Gogoryan, T Sadekova, M Kudinov International Conference on Machine Learning, 8599-8608, 2021 | 552 | 2021 |
Diffusion-based voice conversion with fast maximum likelihood sampling scheme V Popov, I Vovk, V Gogoryan, T Sadekova, M Kudinov, J Wei arXiv preprint arXiv:2109.13821, 2021 | 122 | 2021 |
A Unified System for Voice Cloning and Voice Conversion through Diffusion Probabilistic Modeling T Sadekova, V Gogoryan, I Vovk, V Popov, M Kudinov, J Wei Proc. Interspeech 2022, 3003-3007, 2022 | 11 | 2022 |
Fast Grad-TTS: Towards Efficient Diffusion-Based Speech Generation on CPU. I Vovk, T Sadekova, V Gogoryan, V Popov, MA Kudinov, J Wei Interspeech, 838-842, 2022 | 9 | 2022 |
Optimal transport in diffusion modeling for conversion tasks in audio domain V Popov, A Amatov, M Kudinov, V Gogoryan, T Sadekova, I Vovk ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023 | 6 | 2023 |
A Semi-Supervised Deep Learning Approach to Dataset Collection for Query-by-Humming Task. A Amatov, D Lamanov, M Titov, I Vovk, I Makarov, MA Kudinov ISMIR, 649-656, 2023 | 1 | 2023 |
Efficient Strategies of Few-Shot On-Device Voice Cloning T Sadekova, V Popov, V Gogoryan, I Vovk, A Drogolyub, D Polubotko, ... | | |