Speech to semantics: Improve asr and nlu jointly via all-neural interfaces M Rao, A Raju, P Dheram, B Bui, A Rastrow arXiv preprint arXiv:2008.06173, 2020 | 47 | 2020 |
Toward fairness in speech recognition: Discovery and mitigation of performance disparities P Dheram, M Ramakrishnan, A Raju, IF Chen, B King, K Powell, ... arXiv preprint arXiv:2207.11345, 2022 | 37 | 2022 |
Do as i mean, not as i say: Sequence loss training for spoken language understanding M Rao, P Dheram, G Tiwari, A Raju, J Droppo, A Rastrow, A Stolcke ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021 | 20 | 2021 |
On joint training with interfaces for spoken language understanding A Raju, M Rao, G Tiwari, P Dheram, B Anderson, Z Zhang, C Lee, B Bui, ... arXiv preprint arXiv:2106.15919, 2021 | 11 | 2021 |
End-to-end spoken language understanding using rnn-transducer asr A Raju, G Tiwari, M Rao, P Dheram, B Anderson, Z Zhang, B Bui, ... arXiv preprint arXiv:2106.15919, 2021 | 8 | 2021 |
Turn-taking and backchannel prediction with acoustic and large language model fusion J Wang, L Chen, A Khare, A Raju, P Dheram, D He, M Wu, A Stolcke, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 6 | 2024 |
Hot-fixing wake word recognition for end-to-end ASR via neural model reprogramming PJ Ku, IF Chen, CHH Yang, A Raju, P Dheram, P Ghahremani, B King, ... ICASSP 2024-2024 IEEE International Conference on Acoustics, Speech and …, 2024 | 1 | 2024 |
Multi-stage multi-modal pre-training for automatic speech recognition Y Jain, D Chan, P Dheram, A Khare, O Shonibare, V Ravichandran, ... arXiv preprint arXiv:2403.19822, 2024 | 1 | 2024 |