关注
Jacob Beck
Jacob Beck
University of Oxford
在 alumni.brown.edu 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
A survey of meta-reinforcement learning
J Beck, R Vuorio, EZ Liu, Z Xiong, L Zintgraf, C Finn, S Whiteson
arXiv preprint arXiv:2301.08028, 2023
982023
Hypernetworks in Meta-Reinforcement Learning
J Beck, MT Jackson, R Vuorio, S Whiteson
6th Annual Conference on Robot Learning, 2022
242022
Amrl: Aggregated memory for reinforcement learning
J Beck, K Ciosek, S Devlin, S Tschiatschek, C Zhang, K Hofmann
International Conference on Learning Representations, 2020
202020
Monotonic Improvement Guarantees under Non-stationarity for Decentralized PPO
M Sun, S Devlin, J Beck, K Hofmann, S Whiteson
arXiv preprint arXiv:2202.00082, 2022
122022
On the practical consistency of meta-reinforcement learning algorithms
Z Xiong, L Zintgraf, J Beck, R Vuorio, S Whiteson
arXiv preprint arXiv:2112.00478, 2021
92021
Stackelberg punishment and bully-proofing autonomous vehicles
M Cooper, JK Lee, J Beck, JD Fishman, M Gillett, Z Papakipos, A Zhang, ...
Social Robotics: 11th International Conference, ICSR 2019, Madrid, Spain …, 2019
92019
Trust region bounds for decentralized ppo under non-stationarity
M Sun, S Devlin, J Beck, K Hofmann, S Whiteson
arXiv preprint arXiv:2202.00082, 2022
82022
No DICE: An investigation of the bias-variance tradeoff in meta-gradients
R Vuorio, JA Beck, G Farquhar, JN Foerster, S Whiteson
Deep RL Workshop NeurIPS 2021, 2021
52021
Recurrent hypernetworks are surprisingly strong in meta-RL
J Beck, R Vuorio, Z Xiong, S Whiteson
Advances in Neural Information Processing Systems 36, 2024
32024
Universal morphology control via contextual modulation
Z Xiong, J Beck, S Whiteson
International Conference on Machine Learning, 38286-38300, 2023
32023
Reneg and backseat driver: Learning from demonstration with continuous human feedback
J Beck, Z Papakipos, M Littman
arXiv preprint arXiv:1901.05101, 2019
22019
SplAgger: Split Aggregation for Meta-Reinforcement Learning
J Beck, M Jackson, R Vuorio, Z Xiong, S Whiteson
arXiv preprint arXiv:2403.03020, 2024
2024
Distilling Morphology-Conditioned Hypernetworks for Efficient Universal Morphology Control
Z Xiong, R Vuorio, J Beck, M Zimmer, K Shao, S Whiteson
arXiv preprint arXiv:2402.06570, 2024
2024
An Investigation of the Bias-Variance Tradeoff in Meta-Gradients
R Vuorio, J Beck, S Whiteson, J Foerster, G Farquhar
arXiv preprint arXiv:2209.11303, 2022
2022
Human-Actor Human-Critic
J Beck, N Srinivasan, A Shah, J Roy
2020
Neural Mesh: Introducing a Notion of Space and Conservation of Energy to Neural Networks
J Beck, Z Papakipos
arXiv preprint arXiv:1807.11121, 2018
2018
Informing climate risk analysis using textual information-A research agenda
A Dimmelmeier, HC Doll, M Schierholz, E Kormanyos, M Fehr, B Ma, ...
Natural Language Processing meets Climate Change@ ACL 2024, 0
Hypernetworks in Meta-Reinforcement Learning Supplementary Materials
J Beck, M Jackson, R Vuorio, S Whiteson
ReNeg and Backseat Driver: Learning from demonstration with continuous human feedback
Z Papakipos, J Beck, M Littman
Collaboration in Deep MARL
J Beck
系统目前无法执行此操作,请稍后再试。
文章 1–20