Gabriel Dulac-Arnold 个人学术档案

引用次数

	总计	2019 年至今
引用	4908	4534
h 指数	20	19
i10 指数	26	24

1100

550

275

825

201520162017201820192020202120222023202414 22 74 212 306 590 807 964 1100 761

开放获取的出版物数量

查看全部

2 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Todd HesterWaymo在 waymo.com 的电子邮件经过验证
Daniel J. MankowitzGoogle Deepmind在 google.com 的电子邮件经过验证
Olivier PietquinCohere | ex Google DeepMind (On leave - Professor at University of Lille)在 univ-lille.fr 的电子邮件经过验证
Tom SchaulSenior Staff Scientist, DeepMind在 nyu.edu 的电子邮件经过验证
Cosmin PaduraruDeepMind在 google.com 的电子邮件经过验证
Ludovic DenoyerUbisoft, ex Facebook - FAIR -- Full Professor at Sorbonne Universités on Sabatical在 ubisoft.com 的电子邮件经过验证
Patrick GallinariProfessor Sorbonne University / Criteo AI Lab在 sorbonne-universite.fr 的电子邮件经过验证
Ian OsbandOpenAI在 openai.com 的电子邮件经过验证
Mel VecerikDeepMind, University College London在 ucl.ac.uk 的电子邮件经过验证
Thomas DegrisDeepMind在 google.com 的电子邮件经过验证
Hado van HasseltResearch Scientist, DeepMind; Honorary Professor, UCL在 google.com 的电子邮件经过验证
Richard EvansResearch Fellow, University of Wolverhampton在 wlv.ac.uk 的电子邮件经过验证
Peter SunehagGoogle - DeepMind在 google.com 的电子邮件经过验证
Ben CoppinGoogle DeepMind在 deepmind.com 的电子邮件经过验证
Nir LevineResearch Engineer at DeepMind在 google.com 的电子邮件经过验证
Sven GowalDeepMind在 deepmind.com 的电子邮件经过验证
Marc LanctotResearch Scientist, Google DeepMind在 google.com 的电子邮件经过验证
Philippe PreuxProfessor of computer science, Université de Lille, LIFL, SequeL, INRIA在 univ-lille.fr 的电子邮件经过验证
Jean-Philippe VertOwkin / PSL University在 owkin.com 的电子邮件经过验证
Richard EvansResearch Scientist, DeepMind在 google.com 的电子邮件经过验证

关注

Gabriel Dulac-Arnold

Google DeepMind

在 squirrelsoup.net 的电子邮件经过验证 - 首页

Reinforcement Learning Planning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Learning from demonstrations for real world reinforcement learning T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, A Sendonaris, ... arXiv preprint arXiv:1704.03732, 2017	1424*	2017
Deep reinforcement learning in large discrete action spaces G Dulac-Arnold, R Evans, H van Hasselt, P Sunehag, T Lillicrap, J Hunt, ... arXiv preprint arXiv:1512.07679, 2015	729	2015
Challenges of real-world reinforcement learning G Dulac-Arnold, D Mankowitz, T Hester ICML Workshop on Real-Life RL, 2019	665	2019
Challenges of real-world reinforcement learning: definitions, benchmarks and analysis G Dulac-Arnold, N Levine, DJ Mankowitz, J Li, C Paduraru, S Gowal, ... Machine Learning 110 (9), 2419-2468, 2021	571*	2021
The predictron: End-to-end learning and planning D Silver, H Hasselt, M Hessel, T Schaul, A Guez, T Harley, ... International Conference on Machine Learning, 3191-3199, 2017	306	2017
Acme: A research framework for distributed reinforcement learning MW Hoffman, B Shahriari, J Aslanides, G Barth-Maron, N Momchev, ... arXiv preprint arXiv:2006.00979, 2020	249	2020
Rl unplugged: A suite of benchmarks for offline reinforcement learning C Gulcehre, Z Wang, A Novikov, T Paine, S Gómez, K Zolna, R Agarwal, ... Advances in Neural Information Processing Systems 33, 7248-7259, 2020	197*	2020
Model-based offline planning A Argenson, G Dulac-Arnold ICLR 2021, 2020	144	2020
Learning to run a power network challenge: a retrospective analysis A Marot, B Donnot, G Dulac-Arnold, A Kelly, A O’Sullivan, J Viebahn, ... NeurIPS 2020 Competition and Demonstration Track, 112-132, 2021	71	2021
Datum-wise classification: a sequential approach to sparsity G Dulac-Arnold, L Denoyer, P Preux, P Gallinari Machine Learning and Knowledge Discovery in Databases: European Conference …, 2011	69	2011
Deep reinforcement learning with attention for slate markov decision processes with high-dimensional states and actions P Sunehag, R Evans, G Dulac-Arnold, Y Zwols, D Visentin, B Coppin arXiv preprint arXiv:1512.01124, 2015	58	2015
AI-based mobile application to fight antibiotic resistance M Pascucci, G Royer, J Adamek, MA Asmar, D Aristizabal, L Blanche, ... Nature communications 12 (1), 1173, 2021	57	2021
Differentiable deep clustering with cluster size constraints A Genevay, G Dulac-Arnold, JP Vert arXiv preprint arXiv:1910.09036, 2019	45	2019
Text classification: A sequential reading approach G Dulac-Arnold, L Denoyer, P Gallinari European Conference on Information Retrieval, 411-423, 2011	40	2011
Deep multi-class learning from label proportions G Dulac-Arnold, N Zeghidour, M Cuturi, L Beyer, JP Vert arXiv preprint arXiv:1905.12909, 2019	38	2019
Barkour: Benchmarking animal-level agility with quadruped robots K Caluwaerts, A Iscen, JC Kew, W Yu, T Zhang, D Freeman, KH Lee, ... arXiv preprint arXiv:2305.14654, 2023	28	2023
Fast reinforcement learning with large action sets using error-correcting output codes for mdp factorization G Dulac-Arnold, L Denoyer, P Preux, P Gallinari Joint European Conference on Machine Learning and Knowledge Discovery in …, 2012	28	2012
Learning dynamics models for model predictive agents M Lutter, L Hasenclever, A Byravan, G Dulac-Arnold, P Trochim, N Heess, ... arXiv preprint arXiv:2109.14311, 2021	24	2021
Sequential approaches for learning datum-wise sparse representations G Dulac-Arnold, L Denoyer, P Preux, P Gallinari Machine learning 89, 87-122, 2012	24	2012
Residual reinforcement learning from demonstrations M Alakuijala, G Dulac-Arnold, J Mairal, J Ponce, C Schmid arXiv preprint arXiv:2106.08050, 2021	22	2021

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用