A Demonstration of the Merlin Open Source Neural Network Speech Synthesis System S Ronanki, Z Wu, O Watts, S King SSW9, 133, 2016 | 422* | 2016 |
ASVspoof 2019: A large-scale public database of synthesized, converted and replayed speech X Wang, J Yamagishi, M Todisco, H Delgado, A Nautsch, N Evans, ... Computer Speech & Language 64, 101114, 2020 | 320 | 2020 |
Fine-grained robust prosody transfer for single-speaker neural text-to-speech V Klimkov, S Ronanki, J Rohnke, T Drugman arXiv preprint arXiv:1907.02479, 2019 | 91 | 2019 |
Effect of data reduction on sequence-to-sequence neural TTS J Latorre, J Lachowicz, J Lorenzo-Trueba, T Merritt, T Drugman, ... ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019 | 73 | 2019 |
Robust TTS duration modelling using DNNs GE Henter, S Ronanki, O Watts, M Wester, Z Wu, S King 2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016 | 48 | 2016 |
Transformer-transducers for code-switched speech recognition S Dalmia, Y Liu, S Ronanki, K Kirchhoff ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 45 | 2021 |
Learning interpretable control dimensions for speech synthesis by using external data Z Hodari, O Watts, S Ronanki, S King Interspeech 2018, 32-36, 2018 | 36 | 2018 |
Robust prediction of punctuation and truecasing for medical asr M Sunkara, S Ronanki, K Dixit, S Bodapati, K Kirchhoff arXiv preprint arXiv:2007.02025, 2020 | 33 | 2020 |
In other news: A bi-style text-to-speech model for synthesizing newscaster voice with limited data N Prateek, M Łajszczak, R Barra-Chicote, T Drugman, J Lorenzo-Trueba, ... arXiv preprint arXiv:1904.02790, 2019 | 33 | 2019 |
Multimodal semi-supervised learning framework for punctuation prediction in conversational speech M Sunkara, S Ronanki, D Bekal, S Bodapati, K Kirchhoff arXiv preprint arXiv:2008.00702, 2020 | 32 | 2020 |
A Template-Based Approach for Speech Synthesis Intonation Generation Using LSTMs. S Ronanki, GE Henter, Z Wu, S King INTERSPEECH, 2463-2467, 2016 | 25 | 2016 |
Automatic Pronunciation Scoring And Mispronunciation Detection Using CMUSphinx R Srikanth, J Salsman Proceedings of the Workshop on Speech and Language Processing Tools in …, 2012 | 24 | 2012 |
Personalization of ctc speech recognition models S Dingliwal, M Sunkara, S Ronanki, J Farris, K Kirchhoff, S Bodapati 2022 IEEE Spoken Language Technology Workshop (SLT), 302-309, 2023 | 23 | 2023 |
Median-based generation of synthetic speech durations using a non-parametric approach S Ronanki, O Watts, S King, GE Henter 2016 IEEE Spoken Language Technology Workshop (SLT), 686-692, 2016 | 19 | 2016 |
Adapting long context nlm for asr rescoring in conversational agents A Shenoy, S Bodapati, M Sunkara, S Ronanki, K Kirchhoff arXiv preprint arXiv:2104.11070, 2021 | 18 | 2021 |
The CSTR entry to the Blizzard Challenge 2016 T Merritt, S Ronanki, Z Wu, O Watts Blizzard Challenge 2016, 2016 | 18 | 2016 |
A Hierarchical Encoder-Decoder Model for Statistical Parametric Speech Synthesis. S Ronanki, O Watts, S King Interspeech, 1133-1137, 2017 | 16 | 2017 |
Text-to-speech (TTS) processing with transfer of vocal characteristics V Klimkov, TR Drugman, A Galkin, S Ronanki US Patent 11,410,684, 2022 | 14 | 2022 |
Text-to-speech (TTS) processing JL Trueba, TR Drugman, V Klimkov, S Ronanki, TE Merritt, AP Breen, ... US Patent 10,741,169, 2020 | 11 | 2020 |
DNN-based Speech Synthesis for Indian Languages from ASCII text S Ronanki, S Reddy, B Bollepalli, S King arXiv preprint arXiv:1608.05374, 2016 | 11 | 2016 |