Learning from demonstrations for real world reinforcement learning T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, A Sendonaris, ... arXiv preprint arXiv:1704.03732, 2017 | 1424* | 2017 |
Deep reinforcement learning in large discrete action spaces G Dulac-Arnold, R Evans, H van Hasselt, P Sunehag, T Lillicrap, J Hunt, ... arXiv preprint arXiv:1512.07679, 2015 | 729 | 2015 |
Challenges of real-world reinforcement learning G Dulac-Arnold, D Mankowitz, T Hester ICML Workshop on Real-Life RL, 2019 | 665 | 2019 |
Challenges of real-world reinforcement learning: definitions, benchmarks and analysis G Dulac-Arnold, N Levine, DJ Mankowitz, J Li, C Paduraru, S Gowal, ... Machine Learning 110 (9), 2419-2468, 2021 | 571* | 2021 |
The predictron: End-to-end learning and planning D Silver, H Hasselt, M Hessel, T Schaul, A Guez, T Harley, ... International Conference on Machine Learning, 3191-3199, 2017 | 306 | 2017 |
Acme: A research framework for distributed reinforcement learning MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ... arXiv preprint arXiv:2006.00979, 2020 | 249 | 2020 |
Rl unplugged: A suite of benchmarks for offline reinforcement learning C Gulcehre, Z Wang, A Novikov, T Paine, S Gómez, K Zolna, R Agarwal, ... Advances in Neural Information Processing Systems 33, 7248-7259, 2020 | 197* | 2020 |
Model-based offline planning A Argenson, G Dulac-Arnold ICLR 2021, 2020 | 144 | 2020 |
Learning to run a power network challenge: a retrospective analysis A Marot, B Donnot, G Dulac-Arnold, A Kelly, A O’Sullivan, J Viebahn, ... NeurIPS 2020 Competition and Demonstration Track, 112-132, 2021 | 71 | 2021 |
Datum-wise classification: a sequential approach to sparsity G Dulac-Arnold, L Denoyer, P Preux, P Gallinari Machine Learning and Knowledge Discovery in Databases: European Conference …, 2011 | 69 | 2011 |
Deep reinforcement learning with attention for slate markov decision processes with high-dimensional states and actions P Sunehag, R Evans, G Dulac-Arnold, Y Zwols, D Visentin, B Coppin arXiv preprint arXiv:1512.01124, 2015 | 58 | 2015 |
AI-based mobile application to fight antibiotic resistance M Pascucci, G Royer, J Adamek, MA Asmar, D Aristizabal, L Blanche, ... Nature communications 12 (1), 1173, 2021 | 57 | 2021 |
Differentiable deep clustering with cluster size constraints A Genevay, G Dulac-Arnold, JP Vert arXiv preprint arXiv:1910.09036, 2019 | 45 | 2019 |
Text classification: A sequential reading approach G Dulac-Arnold, L Denoyer, P Gallinari European Conference on Information Retrieval, 411-423, 2011 | 40 | 2011 |
Deep multi-class learning from label proportions G Dulac-Arnold, N Zeghidour, M Cuturi, L Beyer, JP Vert arXiv preprint arXiv:1905.12909, 2019 | 38 | 2019 |
Barkour: Benchmarking animal-level agility with quadruped robots K Caluwaerts, A Iscen, JC Kew, W Yu, T Zhang, D Freeman, KH Lee, ... arXiv preprint arXiv:2305.14654, 2023 | 28 | 2023 |
Fast reinforcement learning with large action sets using error-correcting output codes for mdp factorization G Dulac-Arnold, L Denoyer, P Preux, P Gallinari Joint European Conference on Machine Learning and Knowledge Discovery in …, 2012 | 28 | 2012 |
Learning dynamics models for model predictive agents M Lutter, L Hasenclever, A Byravan, G Dulac-Arnold, P Trochim, N Heess, ... arXiv preprint arXiv:2109.14311, 2021 | 24 | 2021 |
Sequential approaches for learning datum-wise sparse representations G Dulac-Arnold, L Denoyer, P Preux, P Gallinari Machine learning 89, 87-122, 2012 | 24 | 2012 |
Residual reinforcement learning from demonstrations M Alakuijala, G Dulac-Arnold, J Mairal, J Ponce, C Schmid arXiv preprint arXiv:2106.08050, 2021 | 22 | 2021 |