Jasper: An end-to-end convolutional neural acoustic model J Li, V Lavrukhin, B Ginsburg, R Leary, O Kuchaiev, JM Cohen, H Nguyen, ... arXiv preprint arXiv:1904.03288, 2019 | 293 | 2019 |
Training neural speech recognition systems with synthetic speech augmentation J Li, R Gadde, B Ginsburg, V Lavrukhin arXiv preprint arXiv:1811.00707, 2018 | 73 | 2018 |
Domain Prompts: Towards memory and compute efficient domain adaptation of ASR systems S Dingliwal, A Shenoy, S Bodapati, A Gandhe, RT Gadde, K Kirchhoff arXiv preprint arXiv:2112.08718, 2021 | 21 | 2021 |
Diff2Lip: Audio Conditioned Diffusion Models for Lip-Synchronization S Mukhopadhyay, S Suri, RT Gadde, A Shrivastava arXiv preprint arXiv:2308.09716, 2023 | 12 | 2023 |
Prompt-tuning in ASR systems for efficient domain-adaptation S Dingliwal, A Shenoy, S Bodapati, A Gandhe, RT Gadde, K Kirchhoff arXiv preprint arXiv:2110.06502, 2021 | 5 | 2021 |
SIDGAN: High-Resolution Dubbed Video Generation via Shift-Invariant Learning U Muaz, W Jang, R Tripathi, S Mani, W Ouyang, RT Gadde, B Gecer, ... Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 4 | 2023 |
Neural composition: Learning to generate from multiple models D Filimonov, RT Gadde, A Rastrow arXiv preprint arXiv:2007.16013, 2020 | 4 | 2020 |
Towards Continual Entity Learning in Language Models for Conversational Agents RT Gadde, I Bulyko arXiv preprint arXiv:2108.00082, 2021 | 2 | 2021 |
Entity language models for speech processing D Filimonov, RT Gadde, A Rastrow US Patent 11,688,394, 2023 | 1 | 2023 |
RefTextLAS: Reference Text Biased Listen, Attend, and Spell Model For Accurate Reading Evaluation PS Nidadavolu, N Xu, N Jutila, RT Gadde, AA Dara, J Savold, S Patel, ... | | |