Mastering the game of Go with deep neural networks and tree search D Silver, A Huang, CJ Maddison, A Guez, L Sifre, G Van Den Driessche, ... nature 529 (7587), 484-489, 2016 | 15304 | 2016 |
Continuous control with deep reinforcement learning TP Lillicrap, JJ Hunt, A Pritzel, N Heess, T Erez, Y Tassa, D Silver, ... ICLR 2016; arXiv preprint arXiv:1509.02971, 2015 | 11590 | 2015 |
Mastering the game of go without human knowledge D Silver, J Schrittwieser, K Simonyan, I Antonoglou, A Huang, A Guez, ... nature 550 (7676), 354-359, 2017 | 8580 | 2017 |
Asynchronous methods for deep reinforcement learning V Mnih, AP Badia, M Mirza, A Graves, TP Lillicrap, T Harley, D Silver, ... arXiv:1602.01783, 2016 | 8428 | 2016 |
Matching networks for one shot learning O Vinyals, C Blundell, T Lillicrap, K Kavukcuoglu, D Wierstra arXiv preprint arXiv:1606.04080, 2016 | 5743 | 2016 |
A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai, A Guez, M Lanctot, ... Science 362 (6419), 1140-1144, 2018 | 3075 | 2018 |
Grandmaster level in StarCraft II using multi-agent reinforcement learning O Vinyals, I Babuschkin, WM Czarnecki, M Mathieu, A Dudzik, J Chung, ... Nature 575 (7782), 350-354, 2019 | 2833 | 2019 |
Meta-learning with memory-augmented neural networks A Santoro, S Bartunov, M Botvinick, D Wierstra, T Lillicrap International conference on machine learning, 1842-1850, 2016 | 2376 | 2016 |
Mastering chess and shogi by self-play with a general reinforcement learning algorithm D Silver, T Hubert, J Schrittwieser, I Antonoglou, M Lai, A Guez, M Lanctot, ... arXiv preprint arXiv:1712.01815, 2017 | 1658 | 2017 |
Deep reinforcement learning for robotic manipulation S Gu, E Holly, T Lillicrap, S Levine arXiv:1610.00633, 2016 | 1572* | 2016 |
A simple neural network module for relational reasoning A Santoro, D Raposo, DG Barrett, M Malinowski, R Pascanu, P Battaglia, ... Advances in neural information processing systems 30, 2017 | 1544 | 2017 |
Mastering atari, go, chess and shogi by planning with a learned model J Schrittwieser, I Antonoglou, T Hubert, K Simonyan, L Sifre, S Schmitt, ... Nature 588 (7839), 604-609, 2020 | 1374 | 2020 |
Continuous deep Q-learning with model-based acceleration S Gu, T Lillicrap, I Sutskever, S Levine ICML2016; arXiv:1603.00748 [cs.LG], 2016 | 1064 | 2016 |
Learning latent dynamics for planning from pixels D Hafner, T Lillicrap, I Fischer, R Villegas, D Ha, H Lee, J Davidson International conference on machine learning, 2555-2565, 2019 | 974 | 2019 |
Why copy others? Insights from the social learning strategies tournament L Rendell, R Boyd, D Cownden, M Enquist, K Eriksson, MW Feldman, ... Science 328 (5975), 208-213, 2010 | 833 | 2010 |
Starcraft ii: A new challenge for reinforcement learning O Vinyals, T Ewalds, S Bartunov, P Georgiev, AS Vezhnevets, M Yeo, ... arXiv preprint arXiv:1708.04782, 2017 | 820 | 2017 |
Dream to control: Learning behaviors by latent imagination D Hafner, T Lillicrap, J Ba, M Norouzi arXiv preprint arXiv:1912.01603, 2019 | 691 | 2019 |
Random synaptic feedback weights support error backpropagation for deep learning TP Lillicrap, D Cownden, DB Tweed, CJ Akerman Nature communications 7 (1), 13276, 2016 | 678 | 2016 |
A deep learning framework for neuroscience BA Richards, TP Lillicrap, P Beaudoin, Y Bengio, R Bogacz, ... Nature neuroscience 22 (11), 1761-1770, 2019 | 591 | 2019 |
Vector-based navigation using grid-like representations in artificial agents A Banino, C Barry, B Uria, C Blundell, T Lillicrap, P Mirowski, A Pritzel, ... Nature 557 (7705), 429-433, 2018 | 552 | 2018 |