Jacob Beck 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	193	193
h 指数	7	7
i10 指数	4	4

202020212022202320242 8 22 89 72

开放获取的出版物数量

查看全部

4 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Shimon WhitesonProfessor of Computer Science, University of Oxford / Senior Staff Research Scientist, Waymo在 cs.ox.ac.uk 的电子邮件经过验证
Risto VuorioUniversity of Oxford在 cs.ox.ac.uk 的电子邮件经过验证
Luisa ZintgrafDeepMind在 deepmind.com 的电子邮件经过验证
Sam DevlinMicrosoft Research Cambridge在 microsoft.com 的电子邮件经过验证
Katja HofmannMicrosoft Research在 microsoft.com 的电子邮件经过验证
Zoë PapakiposMeta AI在 fb.com 的电子邮件经过验证
Michael LittmanBrown University在 brown.edu 的电子邮件经过验证

关注

Jacob Beck

University of Oxford

在 alumni.brown.edu 的电子邮件经过验证 - 首页

Reinforcement Learning Sequence Models Meta-RL Multi-Agent In-Context Learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
A survey of meta-reinforcement learning J Beck, R Vuorio, EZ Liu, Z Xiong, L Zintgraf, C Finn, S Whiteson arXiv preprint arXiv:2301.08028, 2023	98	2023
Hypernetworks in Meta-Reinforcement Learning J Beck, MT Jackson, R Vuorio, S Whiteson 6th Annual Conference on Robot Learning, 2022	24	2022
Amrl: Aggregated memory for reinforcement learning J Beck, K Ciosek, S Devlin, S Tschiatschek, C Zhang, K Hofmann International Conference on Learning Representations, 2020	20	2020
Monotonic Improvement Guarantees under Non-stationarity for Decentralized PPO M Sun, S Devlin, J Beck, K Hofmann, S Whiteson arXiv preprint arXiv:2202.00082, 2022	12	2022
On the practical consistency of meta-reinforcement learning algorithms Z Xiong, L Zintgraf, J Beck, R Vuorio, S Whiteson arXiv preprint arXiv:2112.00478, 2021	9	2021
Stackelberg punishment and bully-proofing autonomous vehicles M Cooper, JK Lee, J Beck, JD Fishman, M Gillett, Z Papakipos, A Zhang, ... Social Robotics: 11th International Conference, ICSR 2019, Madrid, Spain …, 2019	9	2019
Trust region bounds for decentralized ppo under non-stationarity M Sun, S Devlin, J Beck, K Hofmann, S Whiteson arXiv preprint arXiv:2202.00082, 2022	8	2022
No DICE: An investigation of the bias-variance tradeoff in meta-gradients R Vuorio, JA Beck, G Farquhar, JN Foerster, S Whiteson Deep RL Workshop NeurIPS 2021, 2021	5	2021
Recurrent hypernetworks are surprisingly strong in meta-RL J Beck, R Vuorio, Z Xiong, S Whiteson Advances in Neural Information Processing Systems 36, 2024	3	2024
Universal morphology control via contextual modulation Z Xiong, J Beck, S Whiteson International Conference on Machine Learning, 38286-38300, 2023	3	2023
Reneg and backseat driver: Learning from demonstration with continuous human feedback J Beck, Z Papakipos, M Littman arXiv preprint arXiv:1901.05101, 2019	2	2019
SplAgger: Split Aggregation for Meta-Reinforcement Learning J Beck, M Jackson, R Vuorio, Z Xiong, S Whiteson arXiv preprint arXiv:2403.03020, 2024		2024
Distilling Morphology-Conditioned Hypernetworks for Efficient Universal Morphology Control Z Xiong, R Vuorio, J Beck, M Zimmer, K Shao, S Whiteson arXiv preprint arXiv:2402.06570, 2024		2024
An Investigation of the Bias-Variance Tradeoff in Meta-Gradients R Vuorio, J Beck, S Whiteson, J Foerster, G Farquhar arXiv preprint arXiv:2209.11303, 2022		2022
Human-Actor Human-Critic J Beck, N Srinivasan, A Shah, J Roy		2020
Neural Mesh: Introducing a Notion of Space and Conservation of Energy to Neural Networks J Beck, Z Papakipos arXiv preprint arXiv:1807.11121, 2018		2018
Informing climate risk analysis using textual information-A research agenda A Dimmelmeier, HC Doll, M Schierholz, E Kormanyos, M Fehr, B Ma, ... Natural Language Processing meets Climate Change@ ACL 2024, 0
Hypernetworks in Meta-Reinforcement Learning Supplementary Materials J Beck, M Jackson, R Vuorio, S Whiteson
ReNeg and Backseat Driver: Learning from demonstration with continuous human feedback Z Papakipos, J Beck, M Littman
Collaboration in Deep MARL J Beck

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用