Recurrent neural network based language model. T Mikolov, M Karafiát, L Burget, J Cernocký, S Khudanpur Interspeech 2 (3), 1045-1048, 2010 | 7903 | 2010 |
Librispeech: an asr corpus based on public domain audio books V Panayotov, G Chen, D Povey, S Khudanpur 2015 IEEE international conference on acoustics, speech and signal …, 2015 | 6350 | 2015 |
X-vectors: Robust dnn embeddings for speaker recognition D Snyder, D Garcia-Romero, G Sell, D Povey, S Khudanpur 2018 IEEE international conference on acoustics, speech and signal …, 2018 | 3065 | 2018 |
Extensions of recurrent neural network language model T Mikolov, S Kombrink, L Burget, J Černocký, S Khudanpur 2011 IEEE international conference on acoustics, speech and signal …, 2011 | 1670 | 2011 |
Audio augmentation for speech recognition. T Ko, V Peddinti, D Povey, S Khudanpur Interspeech 2015, 3586, 2015 | 1348 | 2015 |
A time delay neural network architecture for efficient modeling of long temporal contexts. V Peddinti, D Povey, S Khudanpur Interspeech, 3214-3218, 2015 | 1316 | 2015 |
Deep neural network embeddings for text-independent speaker verification. D Snyder, D Garcia-Romero, D Povey, S Khudanpur Interspeech 2017, 999-1003, 2017 | 1043 | 2017 |
A study on data augmentation of reverberant speech for robust speech recognition T Ko, V Peddinti, D Povey, ML Seltzer, S Khudanpur 2017 IEEE international conference on acoustics, speech and signal …, 2017 | 1030 | 2017 |
Purely sequence-trained neural networks for ASR based on lattice-free MMI. D Povey, V Peddinti, D Galvez, P Ghahremani, V Manohar, X Na, Y Wang, ... Interspeech, 2751-2755, 2016 | 989 | 2016 |
Semi-orthogonal low-rank matrix factorization for deep neural networks. D Povey, G Cheng, Y Wang, K Li, H Xu, M Yarmohammadi, S Khudanpur Interspeech, 3743-3747, 2018 | 600 | 2018 |
Deep neural network-based speaker embeddings for end-to-end speaker verification D Snyder, P Ghahremani, D Povey, D Garcia-Romero, Y Carmiel, ... 2016 IEEE spoken language technology workshop (SLT), 165-170, 2016 | 435 | 2016 |
Jhu-isi gesture and skill assessment working set (jigsaws): A surgical activity dataset for human motion modeling Y Gao, SS Vedula, CE Reiley, N Ahmidi, B Varadarajan, HC Lin, L Tao, ... MICCAI workshop: M2cai 3 (2014), 3, 2014 | 430 | 2014 |
Improving deep neural network acoustic models using generalized maxout networks X Zhang, J Trmal, D Povey, S Khudanpur 2014 IEEE international conference on acoustics, speech and signal …, 2014 | 400 | 2014 |
Parallel training of DNNs with natural gradient and parameter averaging D Povey, X Zhang, S Khudanpur arXiv preprint arXiv:1410.7455, 2014 | 397 | 2014 |
A pitch extraction algorithm tuned for automatic speech recognition P Ghahremani, B BabaAli, D Povey, K Riedhammer, J Trmal, ... 2014 IEEE international conference on acoustics, speech and signal …, 2014 | 388 | 2014 |
Speaker recognition for multi-speaker conversations using x-vectors D Snyder, D Garcia-Romero, G Sell, A McCree, D Povey, S Khudanpur ICASSP 2019-2019 IEEE International conference on acoustics, speech and …, 2019 | 362 | 2019 |
Developments and directions in speech recognition and understanding, Part 1 [DSP Education] JM Baker, L Deng, J Glass, S Khudanpur, CH Lee, N Morgan, ... IEEE Signal processing magazine 26 (3), 75-80, 2009 | 362 | 2009 |
Highway long short-term memory rnns for distant speech recognition Y Zhang, G Chen, D Yu, K Yao, S Khudanpur, J Glass 2016 IEEE international conference on acoustics, speech and signal …, 2016 | 361 | 2016 |
A smorgasbord of features for statistical machine translation FJ Och, D Gildea, S Khudanpur, A Sarkar, K Yamada, A Fraser, S Kumar, ... Proceedings of the Human Language Technology Conference of the North …, 2004 | 361 | 2004 |
CHiME-6 challenge: Tackling multispeaker speech recognition for unsegmented recordings S Watanabe, M Mandel, J Barker, E Vincent, A Arora, X Chang, ... arXiv preprint arXiv:2004.09249, 2020 | 303 | 2020 |