Taking the human out of the loop: A review of Bayesian optimization B Shahriari, K Swersky, Z Wang, RP Adams, N De Freitas Proceedings of the IEEE 104 (1), 148-175, 2015 | 5204 | 2015 |
Dueling network architectures for deep reinforcement learning Z Wang, T Schaul, M Hessel, H Hasselt, M Lanctot, N Freitas International conference on machine learning, 1995-2003, 2016 | 4870 | 2016 |
Grandmaster level in StarCraft II using multi-agent reinforcement learning O Vinyals, I Babuschkin, WM Czarnecki, M Mathieu, A Dudzik, J Chung, ... nature 575 (7782), 350-354, 2019 | 4353 | 2019 |
Emergence of locomotion behaviours in rich environments N Heess, D Tb, S Sriram, J Lemmon, J Merel, G Wayne, Y Tassa, T Erez, ... arXiv preprint arXiv:1707.02286, 2017 | 1093 | 2017 |
Sample efficient actor-critic with experience replay Z Wang, V Bapst, N Heess, V Mnih, R Munos, K Kavukcuoglu, ... arXiv preprint arXiv:1611.01224, 2016 | 984 | 2016 |
Bayesian optimization in a billion dimensions via random embeddings Z Wang, F Hutter, M Zoghi, D Matheson, N De Feitas Journal of Artificial Intelligence Research 55, 361-387, 2016 | 821 | 2016 |
Alphastar: Mastering the real-time strategy game starcraft ii O Vinyals, I Babuschkin, J Chung, M Mathieu, M Jaderberg, ... DeepMind blog 2, 20, 2019 | 543 | 2019 |
Reinforcement and imitation learning for diverse visuomotor skills Y Zhu, Z Wang, J Merel, A Rusu, T Erez, S Cabi, S Tunyasuvunakool, ... arXiv preprint arXiv:1802.09564, 2018 | 354 | 2018 |
Autonomous navigation of stratospheric balloons using reinforcement learning MG Bellemare, S Candido, PS Castro, J Gong, MC Machado, S Moitra, ... Nature 588 (7836), 77-82, 2020 | 350 | 2020 |
Deep fried convnets Z Yang, M Moczulski, M Denil, N De Freitas, A Smola, L Song, Z Wang Proceedings of the IEEE international conference on computer vision, 1476-1483, 2015 | 345 | 2015 |
Learning an embedding space for transferable robot skills K Hausman, JT Springenberg, Z Wang, N Heess, M Riedmiller International Conference on Learning Representations, 2018 | 339 | 2018 |
Playing hard exploration games by watching youtube Y Aytar, T Pfaff, D Budden, T Paine, Z Wang, N De Freitas Advances in neural information processing systems 31, 2018 | 301 | 2018 |
Critic regularized regression Z Wang, A Novikov, K Zolna, JS Merel, JT Springenberg, SE Reed, ... Advances in Neural Information Processing Systems 33, 7768-7778, 2020 | 299 | 2020 |
Robust imitation of diverse behaviors Z Wang, JS Merel, SE Reed, N de Freitas, G Wayne, N Heess Advances in Neural Information Processing Systems 30, 2017 | 240 | 2017 |
Acme: A research framework for distributed reinforcement learning MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ... arXiv preprint arXiv:2006.00979, 2020 | 238 | 2020 |
Parallel multiscale autoregressive density estimation S Reed, A Oord, N Kalchbrenner, SG Colmenarejo, Z Wang, Y Chen, ... International conference on machine learning, 2912-2921, 2017 | 236 | 2017 |
Learning human behaviors from motion capture by adversarial imitation J Merel, Y Tassa, D TB, S Srinivasan, J Lemmon, Z Wang, G Wayne, ... arXiv preprint arXiv:1707.02201, 2017 | 229 | 2017 |
Rl unplugged: A suite of benchmarks for offline reinforcement learning C Gulcehre, Z Wang, A Novikov, T Paine, S Gómez, K Zolna, R Agarwal, ... Advances in Neural Information Processing Systems 33, 7248-7259, 2020 | 163 | 2020 |
Adaptive hamiltonian and riemann manifold monte carlo Z Wang, S Mohamed, N Freitas International conference on machine learning, 1462-1470, 2013 | 155 | 2013 |
Bayesian optimization in alphago Y Chen, A Huang, Z Wang, I Antonoglou, J Schrittwieser, D Silver, ... arXiv preprint arXiv:1812.06855, 2018 | 152 | 2018 |