关注
Risto Vuorio
Risto Vuorio
在 cs.ox.ac.uk 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Multimodal model-agnostic meta-learning via task-aware modulation
R Vuorio, SH Sun, H Hu, JJ Lim
Advances in neural information processing systems 32, 2019
2612019
Deep reinforcement learning for multi-driver vehicle dispatching and repositioning problem
J Holler, R Vuorio, Z Qin, X Tang, Y Jiao, T Jin, S Singh, C Wang, J Ye
2019 IEEE International Conference on Data Mining (ICDM), 1090-1095, 2019
1182019
A survey of meta-reinforcement learning
J Beck, R Vuorio, EZ Liu, Z Xiong, L Zintgraf, C Finn, S Whiteson
arXiv preprint arXiv:2301.08028, 2023
982023
Meta continual learning
R Vuorio, DY Cho, D Kim, J Kim
arXiv preprint arXiv:1806.06928, 2018
332018
Toward multimodal model-agnostic meta-learning
R Vuorio, SH Sun, H Hu, JJ Lim
arXiv preprint arXiv:1812.07172, 2018
302018
Hypernetworks in meta-reinforcement learning
J Beck, MT Jackson, R Vuorio, S Whiteson
Conference on Robot Learning, 1478-1487, 2023
242023
On the practical consistency of meta-reinforcement learning algorithms
Z Xiong, L Zintgraf, J Beck, R Vuorio, S Whiteson
arXiv preprint arXiv:2112.00478, 2021
92021
Adaptive pairwise weights for temporal credit assignment
Z Zheng, R Vuorio, R Lewis, S Singh
Proceedings of the AAAI Conference on Artificial Intelligence 36 (8), 9225-9232, 2022
7*2022
Learning state representations from random deep action-conditional predictions
Z Zheng, V Veeriah, R Vuorio, RL Lewis, S Singh
Advances in Neural Information Processing Systems 34, 23679-23691, 2021
62021
Deconfounded imitation learning
R Vuorio, J Brehmer, H Ackermann, D Dijkman, T Cohen, P de Haan
arXiv preprint arXiv:2211.02667, 2022
52022
No DICE: An investigation of the bias-variance tradeoff in meta-gradients
R Vuorio, JA Beck, G Farquhar, JN Foerster, S Whiteson
Deep RL Workshop NeurIPS 2021, 2021
52021
Discovering general reinforcement learning algorithms with adversarial environment design
MT Jackson, M Jiang, J Parker-Holder, R Vuorio, C Lu, G Farquhar, ...
Advances in Neural Information Processing Systems 36, 2024
42024
Recurrent hypernetworks are surprisingly strong in meta-RL
J Beck, R Vuorio, Z Xiong, S Whiteson
Advances in Neural Information Processing Systems 36, 2024
32024
System and process for deconfounded imitation learning
R Vuorio, DE Pim, JH Brehmer, H Ackermann, TS Cohen, DHF Dijkman
US Patent App. 18/459,258, 2024
2024
SplAgger: Split Aggregation for Meta-Reinforcement Learning
J Beck, M Jackson, R Vuorio, Z Xiong, S Whiteson
arXiv preprint arXiv:2403.03020, 2024
2024
Distilling Morphology-Conditioned Hypernetworks for Efficient Universal Morphology Control
Z Xiong, R Vuorio, J Beck, M Zimmer, K Shao, S Whiteson
arXiv preprint arXiv:2402.06570, 2024
2024
An Investigation of the Bias-Variance Tradeoff in Meta-Gradients
R Vuorio, J Beck, S Whiteson, J Foerster, G Farquhar
arXiv preprint arXiv:2209.11303, 2022
2022
Supplementary Material of Multimodal Model-Agnostic Meta-Learning via Task-Aware Modulation
R Vuorio, SH Sun, H Hu, JJ Lim, C Baselines
Hypernetworks in Meta-Reinforcement Learning Supplementary Materials
J Beck, M Jackson, R Vuorio, S Whiteson
Model-Agnostic Meta-Learning for Multimodal Task Distributions
R Vuorio, SH Sun, H Hu, JJ Lim
系统目前无法执行此操作,请稍后再试。
文章 1–20