Joel Z Leibo 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	13293	11346
h 指数	41	36
i10 指数	65	55

2800

1400

700

2100

20132014201520162017201820192020202120222023202468 84 91 153 436 901 1282 1754 2134 2283 2727 1149

开放获取的出版物数量

查看全部

10 篇文章

1 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Thore GraepelGlobal Lead Computational Science, AI & ML at Altos Labs and Chair of Machine Learning, UCL在 ucl.ac.uk 的电子邮件经过验证
TOMASO POGGIOMcDermott Professor in Brain Sciences, MIT在 ai.mit.edu 的电子邮件经过验证
Edward HughesStaff Research Engineer, DeepMind在 google.com 的电子邮件经过验证
Marc LanctotResearch Scientist, Google DeepMind在 google.com 的电子邮件经过验证
Edgar A. Duéñez-GuzmánGoogle DeepMind在 oeb.harvard.edu 的电子邮件经过验证
Karl TuylsCo-Founder at H (chief Research & Operations), ex-Google DeepMind, Prof at University of Liverpool在 hcompany.ai 的电子邮件经过验证
Wojciech Marian Czarnecki.在 google.com 的电子邮件经过验证
Matthew BotvinickGoogle DeepMind, Yale Law School, University College London在 google.com 的电子邮件经过验证
Charlie BeattieSoftware Engineer, DeepMind在 google.com 的电子邮件经过验证
Peter SunehagGoogle - DeepMind在 google.com 的电子邮件经过验证
Tom SchaulSenior Staff Scientist, DeepMind在 nyu.edu 的电子邮件经过验证
Kevin R. McKeeStaff Research Scientist, Google DeepMind在 deepmind.com 的电子邮件经过验证
Audrūnas Gruslys在 gruslys.com 的电子邮件经过验证
Raphael KösterGoogle DeepMind在 google.com 的电子邮件经过验证
Jane X. WangStaff Research Scientist, DeepMind在 google.com 的电子邮件经过验证
Max JaderbergChief AI Scientist, Isomorphic Labs在 robots.ox.ac.uk 的电子邮件经过验证
Fabio AnselmiAssistant professor at University of Trieste, MIT affiliate在 units.it 的电子邮件经过验证
Vinicius ZambaldiGoogle Deepmind在 google.com 的电子邮件经过验证
Dharshan KumaranGoogle DeepMind在 fil.ion.ucl.ac.uk 的电子邮件经过验证
Zeb Kurth-NelsonDeepMind, UCL在 google.com 的电子邮件经过验证

关注

Joel Z Leibo

Research scientist

在 google.com 的电子邮件经过验证 - 首页

Cooperation in AI & Neuroscience Multi-Agent Reinforcement Learning Machine Learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Value-decomposition networks for cooperative multi-agent learning P Sunehag, G Lever, A Gruslys, WM Czarnecki, V Zambaldi, M Jaderberg, ... arXiv preprint arXiv:1706.05296, 2017	1655	2017
Reinforcement learning with unsupervised auxiliary tasks M Jaderberg, V Mnih, WM Czarnecki, T Schaul, JZ Leibo, D Silver, ... arXiv preprint arXiv:1611.05397, 2016	1378	2016
Deep q-learning from demonstrations T Hester, M Vecerik, O Pietquin, M Lanctot, T Schaul, B Piot, D Horgan, ... Proceedings of the AAAI conference on artificial intelligence 32 (1), 2018	1362*	2018
Learning to reinforcement learn JX Wang, Z Kurth-Nelson, D Tirumala, H Soyer, JZ Leibo, R Munos, ... arXiv preprint arXiv:1611.05763, 2016	1048	2016
Human-level performance in 3D multiplayer games with population-based reinforcement learning M Jaderberg, WM Czarnecki, I Dunning, L Marris, G Lever, AG Castaneda, ... Science 364 (6443), 859-865, 2019	937	2019
Multi-agent reinforcement learning in sequential social dilemmas JZ Leibo, V Zambaldi, M Lanctot, J Marecki, T Graepel arXiv preprint arXiv:1702.03037, 2017	909	2017
Prefrontal cortex as a meta-reinforcement learning system JX Wang, Z Kurth-Nelson, D Kumaran, D Tirumala, H Soyer, JZ Leibo, ... Nature neuroscience 21 (6), 860-868, 2018	625	2018
Deepmind lab C Beattie, JZ Leibo, D Teplyashin, T Ward, M Wainwright, H Küttler, ... arXiv preprint arXiv:1612.03801, 2016	595	2016
Social influence as intrinsic motivation for multi-agent deep reinforcement learning N Jaques, A Lazaridou, E Hughes, C Gulcehre, P Ortega, DJ Strouse, ... International conference on machine learning, 3040-3049, 2019	522*	2019
Model-free episodic control C Blundell, B Uria, A Pritzel, Y Li, A Ruderman, JZ Leibo, J Rae, ... arXiv preprint arXiv:1606.04460, 2016	293	2016
The dynamics of invariant object recognition in the human visual system L Isik, EM Meyers, JZ Leibo, T Poggio Journal of neurophysiology 111 (1), 91-102, 2014	277	2014
Using fast weights to attend to the recent past J Ba, GE Hinton, V Mnih, JZ Leibo, C Ionescu Advances in neural information processing systems 29, 2016	266	2016
Inequity aversion improves cooperation in intertemporal social dilemmas E Hughes, JZ Leibo, M Phillips, K Tuyls, E Dueñez-Guzman, ... Advances in neural information processing systems 31, 2018	246	2018
A multi-agent reinforcement learning model of common-pool resource appropriation J Perolat, JZ Leibo, V Zambaldi, C Beattie, K Tuyls, T Graepel Advances in neural information processing systems 30, 2017	213	2017
Unsupervised predictive memory in a goal-directed agent G Wayne, CC Hung, D Amos, M Mirza, A Ahuja, A Grabska-Barwinska, ... arXiv preprint arXiv:1803.10760, 2018	195	2018
Open problems in cooperative ai A Dafoe, E Hughes, Y Bachrach, T Collins, KR McKee, JZ Leibo, K Larson, ... arXiv preprint arXiv:2012.08630, 2020	192	2020
Emergent communication through negotiation K Cao, A Lazaridou, M Lanctot, JZ Leibo, K Tuyls, S Clark arXiv preprint arXiv:1804.03980, 2018	184	2018
How important is weight symmetry in backpropagation? Q Liao, J Leibo, T Poggio Proceedings of the AAAI Conference on Artificial Intelligence 30 (1), 2016	172	2016
Unsupervised learning of invariant representations F Anselmi, JZ Leibo, L Rosasco, J Mutch, A Tacchetti, T Poggio Theoretical Computer Science 633, 112-121, 2016	143	2016
Kickstarting deep reinforcement learning S Schmitt, JJ Hudson, A Zidek, S Osindero, C Doersch, WM Czarnecki, ... arXiv preprint arXiv:1803.03835, 2018	140	2018

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用