Going deeper with convolutions C Szegedy, W Liu, Y Jia, P Sermanet, S Reed, D Anguelov, D Erhan, ... 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 1-9, 2015 | 61094 | 2015 |
Rethinking the inception architecture for computer vision C Szegedy, V Vanhoucke, S Ioffe, J Shlens, Z Wojna 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2818 …, 2016 | 34047 | 2016 |
Tensorflow: Large-scale machine learning on heterogeneous distributed systems M Abadi, A Agarwal, P Barham, E Brevdo, Z Chen, C Citro, GS Corrado, ... arXiv preprint arXiv:1603.04467, 2016 | 31327* | 2016 |
Inception-v4, inception-resnet and the impact of residual connections on learning C Szegedy, S Ioffe, V Vanhoucke, A Alemi Proceedings of the AAAI conference on artificial intelligence 31 (1), 2017 | 17269 | 2017 |
Deep neural networks for acoustic modeling in speech recognition: The shared views of four research groups G Hinton, L Deng, D Yu, GE Dahl, A Mohamed, N Jaitly, A Senior, ... IEEE Signal processing magazine 29 (6), 82-97, 2012 | 13647 | 2012 |
Scalable deep reinforcement learning for vision-based robotic manipulation D Kalashnikov, A Irpan, P Pastor, J Ibarz, A Herzog, E Jang, D Quillen, ... 2nd Conference on Robot Learning (CoRL), 651-673, 2018 | 1511 | 2018 |
Do as I can, not as I say: Grounding language in robotic affordances M Ahn, A Brohan, N Brown, Y Chebotar, O Cortes, B David, C Finn, ... arXiv preprint arXiv:2204.01691, 2022 | 1266* | 2022 |
Improving the speed of neural networks on CPUs V Vanhoucke, A Senior, MZ Mao 2011 NIPS Deep Learning and Unsupervised Feature Learning Workshop, 2011 | 1055 | 2011 |
PaLM-E: An embodied multimodal language model D Driess, F Xia, MSM Sajjadi, C Lynch, A Chowdhery, B Ichter, A Wahid, ... arXiv preprint arXiv:2303.03378, 2023 | 980 | 2023 |
Sim-to-real: Learning agile locomotion for quadruped robots J Tan, T Zhang, E Coumans, A Iscen, Y Bai, D Hafner, S Bohez, ... Robotics: Science and Systems (RSS) XIV, 2018 | 831 | 2018 |
System and method for enabling the use of captured images through recognition US Patent 20,060,251,339, 0 | 761* | |
On rectified linear units for speech processing MD Zeiler, M Ranzato, R Monga, M Mao, K Yang, QV Le, P Nguyen, ... 2013 IEEE International Conference on Acoustics, Speech and Signal …, 2013 | 737 | 2013 |
Using simulation and domain adaptation to improve efficiency of deep robotic grasping K Bousmalis, A Irpan, P Wohlhart, Y Bai, M Kelcey, M Kalakrishnan, ... 2018 IEEE International Conference on Robotics and Automation (ICRA), 4243-4250, 2018 | 727 | 2018 |
YouTube-BoundingBoxes: A large high-precision human-annotated data set for object detection in video E Real, J Shlens, S Mazzocchi, X Pan, V Vanhoucke IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 5296-5305, 2017 | 672 | 2017 |
System and method for enabling search and retrieval from image files based on recognized information SB Gokturk, D Anguelov, V Vanhoucke, K Lee, D Vu, D Yang, M Shah, ... US Patent 7,809,722, 2010 | 532 | 2010 |
RT-1: Robotics transformer for real-world control at scale A Brohan, N Brown, J Carbajal, Y Chebotar, J Dabis, C Finn, ... arXiv preprint arXiv:2212.06817, 2022 | 504 | 2022 |
System and method for providing objectified image renderings using recognition information from images SB Gokturk, D Anguelov, V Vanhoucke, K Lee, D Vu, D Yang, M Shah, ... US Patent 7,783,135, 2010 | 497* | 2010 |
System and method for recognizing objects from images and identifying relevancy amongst images and information SB Gokturk, D Anguelov, V Vanhoucke, K Lee, D Vu, D Yang, M Shah, ... US Patent 7,809,192, 2010 | 483 | 2010 |
System and method for enabling image recognition and searching of images SB Gokturk, B Sumengen, D Vu, N Dalal, D Yang, X Lin, A Khan, M Shah, ... US Patent 7,657,100, 2010 | 456 | 2010 |
RT-2: Vision-language-action models transfer web knowledge to robotic control A Brohan, N Brown, J Carbajal, Y Chebotar, X Chen, K Choromanski, ... arXiv preprint arXiv:2307.15818, 2023 | 434* | 2023 |