Learning Dynamics Model in Reinforcement Learning by Incorporating the Long Term Future NR Ke, A Singh, A Touati, A Goyal, Y Bengio, D Parikh, D Batra ICLR 2019 - Proceedings of the Seventh International Conference on Learning …, 2019 | 88* | 2019 |
Learning One Representation to Optimize All Rewards A Touati, Y Ollivier NeurIPS 2021: Thirty-fifth Conference on Neural Information Processing Systems, 2021 | 56 | 2021 |
Convergent Tree Backup and Retrace with Function Approximation A Touati, PL Bacon, D Precup, P Vincent ICML 2018, Proceedings of the 35th International Conference on Machine …, 2017 | 49 | 2017 |
Randomized value functions via multiplicative normalizing flows A Touati, H Satija, J Romoff, J Pineau, P Vincent UAI2019: Conference on Uncertainty in Artificial Intelligence 2019, 2018 | 43 | 2018 |
Efficient learning in non-stationary linear markov decision processes A Touati, P Vincent arXiv preprint arXiv:2010.12870, 2020 | 35 | 2020 |
Learnable explicit density for continuous latent space and variational inference CW Huang, A Touati, L Dinh, M Drozdzal, M Havaei, L Charlin, ... ICML 2017 Workshop on Principle Approaches to Deep Learning (padl), 2017 | 31 | 2017 |
Real-time privacy-preserving model-based estimation of traffic flows J Le Ny, A Touati, GJ Pappas 2014 ACM/IEEE International Conference on Cyber-Physical Systems (ICCPS), 92-102, 2014 | 31 | 2014 |
Separable value functions across time-scales J Romoff, P Henderson, A Touati, Y Ollivier, J Pineau, E Brunskill ICML 2019, Proceedings of the 36th International Conference on Machine …, 2019 | 26* | 2019 |
Does Zero-Shot Reinforcement Learning Exist? A Touati, J Rapin, Y Ollivier ICLR 2023, 2022 | 21 | 2022 |
Parametric adversarial divergences are good task losses for generative modeling G Huang, H Berard, A Touati, G Gidel, P Vincent, S Lacoste-Julien MAIS18, Montreal AI Symposium 2018, 2017 | 18* | 2017 |
Stable Policy Optimization via Off-Policy Divergence Regularization A Touati, A Zhang, J Pineau, P Vincent UAI2020: Conference on Uncertainty in Artificial Intelligence 2020, 2020 | 17 | 2020 |
Zooming for efficient model-free reinforcement learning in metric spaces A Touati, AA Taiga, MG Bellemare arXiv preprint arXiv:2003.04069, 2020 | 16 | 2020 |
Maximum reward formulation in reinforcement learning SK Gottipati, Y Pathak, R Nuttall, R Chunduru, A Touati, SG Subramanian, ... arXiv preprint arXiv:2010.03744, 2020 | 12 | 2020 |
TDprop: Does Adaptive Optimization With Jacobi Preconditioning Help Temporal Difference Learning? J Romoff, P Henderson, D Kanaa, E Bengio, A Touati, PL Bacon, J Pineau Proceedings of the 20th International Conference on Autonomous Agents and …, 2021 | 10* | 2021 |
SVRG for policy evaluation with fewer gradient evaluations Z Peng, A Touati, P Vincent, D Precup IJCAI2020: the 29th International Joint Conference on Artificial Intelligence, 2019 | 10 | 2019 |
Stochastic Neural Network with Kronecker Flow CW Huang, A Touati, P Vincent, GK Dziugaite, A Lacoste, A Courville AISTATS 2020 - Proceedings of the 23nd International Conference on …, 2019 | 9 | 2019 |
Fast imitation via behavior foundation models M Pirotta, A Tirinzoni, A Touati, A Lazaric, Y Ollivier NeurIPS 2023 Foundation Models for Decision Making Workshop, 2023 | 3 | 2023 |
Scalable Representation Learning in Linear Contextual Bandits with Constant Regret Guarantees A Tirinzoni, M Papini, A Touati, A Lazaric, M Pirotta NeurIPS 2022, 2022 | 3 | 2022 |
Score models for offline goal-conditioned reinforcement learning H Sikchi, R Chitnis, A Touati, A Geramifard, A Zhang, S Niekum arXiv preprint arXiv:2311.02013, 2023 | 2 | 2023 |
Adversarial divergences are good task losses for generative modeling G Huang, G Gidel, H Berard, A Touati, S Lacoste-Julien arXiv preprint arXiv:1708.02511, 2017 | 2 | 2017 |