Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAX C Bonnet, D Luo, D Byrne, S Surana, V Coyette, P Duckworth, LI Midgley, ... arXiv preprint arXiv:2306.09884, 2023 | 13* | 2023 |
Winner Takes It All: Training Performant RL Populations for Combinatorial Optimization N Grinsztajn, D Furelos-Blanco, S Surana, C Bonnet, TD Barrett Thirty-seventh Conference on Neural Information Processing Systems, 2023 | 12 | 2023 |
Combinatorial Optimization with Policy Adaptation using Latent Space Search F Chalumeau, S Surana, C Bonnet, N Grinsztajn, A Pretorius, A Laterre, ... arXiv preprint arXiv:2311.13569, 2023 | 10 | 2023 |
One Step at a Time: Pros and Cons of Multi-Step Meta-Gradient Reinforcement Learning C Bonnet, P Caron, T Barrett, I Davies, A Laterre 5th Workshop on Meta-Learning at NeurIPS 2021, 2021 | 3 | 2021 |
Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function C Bonnet, L Midgley, A Laterre 6th Workshop on Meta-Learning at NeurIPS 2022, 2022 | 1 | 2022 |