关注
Kanishka Rao
Kanishka Rao
Software Engineer, Google
在 google.com 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Federated learning for mobile keyboard prediction
A Hard, K Rao, R Mathews, S Ramaswamy, F Beaufays, S Augenstein, ...
arXiv preprint arXiv:1811.03604, 2018
15872018
State-of-the-art speech recognition with sequence-to-sequence models
CC Chiu, TN Sainath, Y Wu, R Prabhavalkar, P Nguyen, Z Chen, ...
2018 IEEE international conference on acoustics, speech and signal …, 2018
14242018
Do as i can, not as i say: Grounding language in robotic affordances
M Ahn, A Brohan, N Brown, Y Chebotar, O Cortes, B David, C Finn, C Fu, ...
arXiv preprint arXiv:2204.01691, 2022
10172022
Streaming end-to-end speech recognition for mobile devices
Y He, TN Sainath, R Prabhavalkar, I McGraw, R Alvarez, D Zhao, ...
ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and …, 2019
7052019
Fast and accurate recurrent neural network acoustic models for speech recognition
H Sak, A Senior, K Rao, F Beaufays
Interspeech 2015, 2015
5492015
Rt-1: Robotics transformer for real-world control at scale
A Brohan, N Brown, J Carbajal, Y Chebotar, J Dabis, C Finn, ...
arXiv preprint arXiv:2212.06817, 2022
5152022
Exploring architectures, data and units for streaming end-to-end speech recognition with rnn-transducer
K Rao, H Sak, R Prabhavalkar
2017 IEEE automatic speech recognition and understanding workshop (ASRU …, 2017
4002017
Rt-2: Vision-language-action models transfer web knowledge to robotic control
A Brohan, N Brown, J Carbajal, Y Chebotar, X Chen, K Choromanski, ...
arXiv preprint arXiv:2307.15818, 2023
3802023
A Comparison of sequence-to-sequence models for speech recognition.
R Prabhavalkar, K Rao, TN Sainath, B Li, L Johnson, N Jaitly
Interspeech, 939-943, 2017
3792017
Federated learning for emoji prediction in a mobile keyboard
S Ramaswamy, R Mathews, K Rao, F Beaufays
arXiv preprint arXiv:1906.04329, 2019
3272019
Do as i can, not as i say: Grounding language in robotic affordances
A Brohan, Y Chebotar, C Finn, K Hausman, A Herzog, D Ho, J Ibarz, ...
Conference on robot learning, 287-318, 2023
2872023
Multilingual speech recognition with a single end-to-end model
S Toshniwal, TN Sainath, RJ Weiss, B Li, P Moreno, E Weinstein, K Rao
2018 IEEE international conference on acoustics, speech and signal …, 2018
2782018
Grapheme-to-phoneme conversion using long short-term memory recurrent neural networks
K Rao, F Peng, H Sak, F Beaufays
2015 IEEE International Conference on Acoustics, Speech and Signal …, 2015
2732015
Learning acoustic frame labeling for speech recognition with recurrent neural networks
H Sak, A Senior, K Rao, O Irsoy, A Graves, F Beaufays, J Schalkwyk
2015 IEEE international conference on acoustics, speech and signal …, 2015
2262015
Personalized speech recognition on mobile devices
I McGraw, R Prabhavalkar, R Alvarez, MG Arenas, K Rao, D Rybach, ...
2016 IEEE International Conference on Acoustics, Speech and Signal …, 2016
2202016
Lingvo: a modular and scalable framework for sequence-to-sequence modeling
J Shen, P Nguyen, Y Wu, Z Chen, MX Chen, Y Jia, A Kannan, T Sainath, ...
arXiv preprint arXiv:1902.08295, 2019
2022019
Large-scale visual speech recognition
B Shillingford, Y Assael, MW Hoffman, T Paine, C Hughes, U Prabhu, ...
arXiv preprint arXiv:1807.05162, 2018
1912018
Rl-cyclegan: Reinforcement learning aware simulation-to-real
K Rao, C Harris, A Irpan, S Levine, J Ibarz, M Khansari
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020
1792020
Acoustic modelling with cd-ctc-smbr lstm rnns
A Senior, H Sak, F de Chaumont Quitry, T Sainath, K Rao
2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU …, 2015
1562015
Recurrent neural aligner: An encoder-decoder neural network model for sequence to sequence mapping.
H Sak, M Shannon, K Rao, F Beaufays
Interspeech 8, 1298-1302, 2017
1472017
系统目前无法执行此操作,请稍后再试。
文章 1–20