Value-decomposition networks for cooperative multi-agent learning P Sunehag, G Lever, A Gruslys, WM Czarnecki, V Zambaldi, M Jaderberg, ... arXiv preprint arXiv:1706.05296, 2017 | 1736 | 2017 |
Deep q-learning from demonstrations T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ... Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018 | 1404* | 2018 |
A unified game-theoretic approach to multiagent reinforcement learning M Lanctot, V Zambaldi, A Gruslys, A Lazaridou, K Tuyls, J Pérolat, D Silver, ... Advances in neural information processing systems 30, 2017 | 737 | 2017 |
Comparing, optimizing, and benchmarking quantum-control algorithms in a unifying programming framework S Machnes, U Sander, SJ Glaser, P de Fouquieres, A Gruslys, S Schirmer, ... Physical Review A—Atomic, Molecular, and Optical Physics 84 (2), 022305, 2011 | 291 | 2011 |
Memory-efficient backpropagation through time A Gruslys, R Munos, I Danihelka, M Lanctot, A Graves Advances in neural information processing systems 29, 2016 | 248 | 2016 |
Three-dimensional digital template atlas of the macaque brain C Reveley, A Gruslys, FQ Ye, D Glen, J Samaha, B E. Russ, Z Saad, ... Cerebral cortex 27 (9), 4463-4477, 2017 | 187 | 2017 |
Mastering the game of Stratego with model-free multiagent reinforcement learning J Perolat, B De Vylder, D Hennes, E Tarassov, F Strub, V de Boer, ... Science 378 (6623), 990-996, 2022 | 178 | 2022 |
The reactor: A fast and sample-efficient actor-critic agent for reinforcement learning A Gruslys, W Dabney, MG Azar, B Piot, M Bellemare, R Munos arXiv preprint arXiv:1704.04651, 2017 | 170* | 2017 |
Neural replicator dynamics: Multiagent learning via hedging policy gradients D Hennes, D Morrill, S Omidshafiei, R Munos, J Perolat, M Lanctot, ... Proceedings of the 19th international conference on autonomous agents and …, 2020 | 86* | 2020 |
Psychlab: a psychology laboratory for deep reinforcement learning agents JZ Leibo, CM d'Autume, D Zoran, D Amos, C Beattie, K Anderson, ... arXiv preprint arXiv:1801.08116, 2018 | 58 | 2018 |
Navigating the landscape of multiplayer games S Omidshafiei, K Tuyls, WM Czarnecki, FC Santos, M Rowland, J Connor, ... Nature communications 11 (1), 5603, 2020 | 47 | 2020 |
The advantage regret-matching actor-critic A Gruslys, M Lanctot, R Munos, F Timbers, M Schmid, J Perolat, D Morrill, ... arXiv preprint arXiv:2008.12234, 2020 | 26 | 2020 |
A new fast accurate nonlinear medical image registration program including surface preserving regularization A Gruslys, J Acosta-Cabronero, PJ Nestor, GB Williams, RE Ansorge IEEE transactions on medical imaging 33 (11), 2118-2127, 2014 | 22 | 2014 |
3000 non-rigid medical image registrations overnight on a single PC A Gruslys, S Sawiak, R Ansorge 2011 IEEE Nuclear Science Symposium Conference Record, 3073-3080, 2011 | 12 | 2011 |
Fast computation of Nash equilibria in imperfect information games R Munos, J Perolat, JB Lespiau, M Rowland, B De Vylder, M Lanctot, ... International Conference on Machine Learning, 7119-7129, 2020 | 11 | 2020 |
E Russ B, Saad Z, K Seth A, Leopold DA, Saleem KS, 2017 C Reveley, A Gruslys, FQ Ye, D Glen, J Samaha Three-dimensional digital template atlas of the macaque brain. Cereb. Cortex …, 0 | 11 | |
Training action selection neural networks using leave-one-out-updates M Gendron-Bellemare, MG Azar, A Gruslys, R Munos US Patent 11,604,997, 2023 | 9 | 2023 |
Quantile credit assignment T Mesnard, W Chen, A Saade, Y Tang, M Rowland, T Weber, C Lyle, ... International Conference on Machine Learning, 24517-24531, 2023 | 3 | 2023 |
Navigating the Landscape of Multiplayer Games to Probe the Drosophila of AI S Omidshafiei, K Tuyls, WM Czarnecki, FC Santos, M Rowland, J Connor, ... arXiv e-prints, arXiv: 2005.01642, 2020 | | 2020 |
Development and applications of GPU based medical image registration A Gruslys University of Cambridge, 2014 | | 2014 |