Model-Agnostic Meta-Learning for Fast Adaptation of Deep Networks C Finn, P Abbeel, S Levine International Conference on Machine Learning (ICML), 2017 | 12667 | 2017 |
Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor T Haarnoja, A Zhou, P Abbeel, S Levine International conference on machine learning, 1861-1870, 2018 | 8367 | 2018 |
Trust region policy optimization J Schulman, S Levine, P Moritz, MI Jordan, P Abbeel International Conference on Machine Learning (ICML 2015), 2015 | 8195 | 2015 |
End-to-end training of deep visuomotor policies S Levine, C Finn, T Darrell, P Abbeel Journal of Machine Learning Research 17 (39), 1-40, 2016 | 3918 | 2016 |
High-dimensional continuous control using generalized advantage estimation J Schulman, P Moritz, S Levine, M Jordan, P Abbeel International Conference on Learning Representations (ICLR 2016), 2015 | 3627 | 2015 |
Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection S Levine, P Pastor, A Krizhevsky, J Ibarz, D Quillen The International Journal of Robotics Research 37 (4-5), 421-436, 2018 | 2646 | 2018 |
Soft actor-critic algorithms and applications T Haarnoja, A Zhou, K Hartikainen, G Tucker, S Ha, J Tan, V Kumar, ... arXiv preprint arXiv:1812.05905, 2018 | 2564 | 2018 |
Deep reinforcement learning for robotic manipulation with asynchronous off-policy updates S Gu, E Holly, T Lillicrap, S Levine 2017 IEEE international conference on robotics and automation (ICRA), 3389-3396, 2017 | 1858 | 2017 |
Offline reinforcement learning: Tutorial, review, and perspectives on open problems S Levine, A Kumar, G Tucker, J Fu arXiv preprint arXiv:2005.01643, 2020 | 1720 | 2020 |
Conservative q-learning for offline reinforcement learning A Kumar, A Zhou, G Tucker, S Levine Advances in Neural Information Processing Systems 33, 1179-1191, 2020 | 1601 | 2020 |
Scalable deep reinforcement learning for vision-based robotic manipulation D Kalashnikov, A Irpan, P Pastor, J Ibarz, A Herzog, E Jang, D Quillen, ... Conference on robot learning, 651-673, 2018 | 1512 | 2018 |
Reinforcement learning with deep energy-based policies T Haarnoja, H Tang, P Abbeel, S Levine International conference on machine learning, 1352-1361, 2017 | 1406 | 2017 |
Deep reinforcement learning in a handful of trials using probabilistic dynamics models K Chua, R Calandra, R McAllister, S Levine Advances in neural information processing systems 31, 2018 | 1384 | 2018 |
Guided Policy Search S Levine, V Koltun International Conference on Machine Learning (ICML 2013), 2013 | 1324 | 2013 |
Wilds: A benchmark of in-the-wild distribution shifts PW Koh, S Sagawa, H Marklund, SM Xie, M Zhang, A Balsubramani, ... International conference on machine learning, 5637-5664, 2021 | 1250 | 2021 |
Continuous deep q-learning with model-based acceleration S Gu, T Lillicrap, I Sutskever, S Levine International conference on machine learning, 2829-2838, 2016 | 1243 | 2016 |
Unsupervised learning for physical interaction through video prediction C Finn, I Goodfellow, S Levine Advances in neural information processing systems 29, 2016 | 1194 | 2016 |
Neural network dynamics for model-based deep reinforcement learning with model-free fine-tuning A Nagabandi, G Kahn, RS Fearing, S Levine 2018 IEEE international conference on robotics and automation (ICRA), 7559-7566, 2018 | 1178 | 2018 |
Guided cost learning: Deep inverse optimal control via policy optimization C Finn, S Levine, P Abbeel International conference on machine learning, 49-58, 2016 | 1114 | 2016 |
Recurrent network models for human dynamics K Fragkiadaki, S Levine, P Felsen, J Malik Proceedings of the IEEE international conference on computer vision, 4346-4354, 2015 | 1108 | 2015 |