Recent advances in deep learning for speech research at Microsoft L Deng, J Li, JT Huang, K Yao, D Yu, F Seide, M Seltzer, G Zweig, X He, ... 2013 IEEE international conference on acoustics, speech and signal …, 2013 | 1030 | 2013 |
Gemini: a family of highly capable multimodal models G Team, R Anil, S Borgeaud, Y Wu, JB Alayrac, J Yu, R Soricut, ... arXiv preprint arXiv:2312.11805, 2023 | 947 | 2023 |
Using recurrent neural networks for slot filling in spoken language understanding G Mesnil, Y Dauphin, K Yao, Y Bengio, L Deng, D Hakkani-Tur, X He, ... IEEE/ACM Transactions on Audio, Speech, and Language Processing 23 (3), 530-539, 2014 | 758 | 2014 |
KL-divergence regularized deep neural network adaptation for improved large vocabulary speech recognition D Yu, K Yao, H Su, G Li, F Seide 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013 | 522 | 2013 |
An introduction to computational networks and the computational network toolkit D Yu, A Eversole, M Seltzer, K Yao, Z Huang, B Guenter, O Kuchaiev, ... Microsoft Technical Report MSR-TR-2014–112, 2014 | 471 | 2014 |
Recurrent neural networks for language understanding. K Yao, G Zweig, MY Hwang, Y Shi, D Yu Interspeech, 2524-2528, 2013 | 411 | 2013 |
Spoken language understanding using long short-term memory neural networks K Yao, B Peng, Y Zhang, D Yu, G Zweig, Y Shi 2014 IEEE spoken language technology workshop (SLT), 189-194, 2014 | 402 | 2014 |
Highway long short-term memory rnns for distant speech recognition Y Zhang, G Chen, D Yu, K Yao, S Khudanpur, J Glass 2016 IEEE international conference on acoustics, speech and signal …, 2016 | 361 | 2016 |
Assignment of semantic labels to a sequence of words using neural network architectures A Deoras, K Yao, X He, L Deng, GG Zweig, R Sarikaya, D Yu, MY Hwang, ... US Patent 10,867,597, 2020 | 283 | 2020 |
Adaptation of context-dependent deep neural networks for automatic speech recognition K Yao, D Yu, F Seide, H Su, L Deng, Y Gong 2012 IEEE Spoken Language Technology Workshop (SLT), 366-369, 2012 | 256 | 2012 |
Incorporating structural alignment biases into an attentional neural translation model T Cohn, CDV Hoang, E Vymolova, K Yao, C Dyer, G Haffari arXiv preprint arXiv:1601.01085, 2016 | 199 | 2016 |
Sequence-to-sequence neural net models for grapheme-to-phoneme conversion K Yao, G Zweig arXiv preprint arXiv:1506.00196, 2015 | 197 | 2015 |
System and method for text-to-phoneme mapping with prior knowledge K Yao US Patent App. 11/278,497, 2007 | 170 | 2007 |
Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context M Reid, N Savinov, D Teplyashin, D Lepikhin, T Lillicrap, J Alayrac, ... arXiv preprint arXiv:2403.05530, 2024 | 166 | 2024 |
Recurrent conditional random field for language understanding K Yao, B Peng, G Zweig, D Yu, X Li, F Gao 2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014 | 162 | 2014 |
Attention with intention for a neural network conversation model K Yao, G Zweig, B Peng arXiv preprint arXiv:1510.08565, 2015 | 145 | 2015 |
Hyper-structure recurrent neural networks for text-to-speech P Zhao, M Leung, K Yao, B Yan, S Zhao, FA Alleva US Patent 10,127,901, 2018 | 139 | 2018 |
Depth-gated LSTM K Yao, T Cohn, K Vylomova, K Duh, C Dyer arXiv preprint arXiv:1508.03790, 2015 | 123 | 2015 |
Depth-gated recurrent neural networks K Yao, T Cohn, K Vylomova, K Duh, C Dyer arXiv preprint arXiv:1508.03790 9, 98, 2015 | 110 | 2015 |
Conservatively adapting a deep neural network in a recognition system D Yu, K Yao, H Su, G Li, F Seide US Patent 9,177,550, 2015 | 96 | 2015 |