meta reinforcement learning- 学术资源搜索

Meta-reinforcement learning of structured exploration strategies

A Gupta, R Mendonca, YX Liu… - Advances in neural …, 2018 - proceedings.neurips.cc

… arises with meta-learning exploration is: if our goal is to learn … diverse and challenging tasks
at meta-training time to acquire … reward tasks to meta-learn exploration strategies that work …

被引用次数：411 相关文章所有 8 个版本

[PDF] arxiv.org

Learning to adapt in dynamic, real-world environments through meta-reinforcement learning

A Nagabandi, I Clavera, S Liu, RS Fearing… - arXiv preprint arXiv …, 2018 - arxiv.org

… meta reinforcement learning approach that achieves online adaptation in dynamic environments.
To the best knowledge of the authors, this is the first meta-reinforcement learning … meta …

被引用次数：592 相关文章所有 7 个版本

[PDF] mlr.press

Meta-world: A benchmark and evaluation for multi-task and meta reinforcement learning

T Yu, D Quillen, Z He, R Julian… - … on robot learning, 2020 - proceedings.mlr.press

… In order to study the capabilities of current multi-task and meta-reinforcement learning … We
contend that multi-task and meta reinforcement learning methods that aim to efficiently learn …

被引用次数：957 相关文章所有 8 个版本

[PDF] arxiv.org

Meta reinforcement learning as task inference

J Humplik, A Galashov, L Hasenclever… - arXiv preprint arXiv …, 2019 - arxiv.org

… meta-training is a simple and cheap way to boost the performance of meta-RL agents; 2) We
show that we can train such meta-… that our agents can solve difficult meta-RL problems in …

被引用次数：135 相关文章所有 7 个版本

相关搜索

[PDF] arxiv.org

A survey of meta-reinforcement learning

J Beck, R Vuorio, EZ Liu, Z Xiong, L Zintgraf… - arXiv preprint arXiv …, 2023 - arxiv.org

… meta-RL: few-shot meta-RL. Here, the goal is to learn an RL algorithm capable of fast adaptation,
ie, learning … on a given task distribution, and meta-learn how to efficiently adapt to any …

被引用次数：95 相关文章所有 2 个版本

[PDF] biorxiv.org

Prefrontal cortex as a meta-reinforcement learning system

JX Wang, Z Kurth-Nelson, D Kumaran, D Tirumala… - Nature …, 2018 - nature.com

… Although we cannot rule out the possibility that forms of meta-learning may also emerge in
these loops, we observe that the meta-RL effect described in this paper emerges only when …

被引用次数：625 相关文章所有 12 个版本

[PDF] neurips.cc

On the effectiveness of fine-tuning versus meta-reinforcement learning

M Zhao, P Abbeel, S James - Advances in Neural …, 2022 - proceedings.neurips.cc

… tasks in order to learn new ones quickly and efficiently. Meta-learning approaches have
emerged as a popular solution to achieve this. However, metareinforcement learning (meta-RL) …

被引用次数：11 相关文章所有 2 个版本

[PDF] neurips.cc

Meta-gradient reinforcement learning

Z Xu, HP van Hasselt, D Silver - Advances in neural …, 2018 - proceedings.neurips.cc

… of learning. We derive a practical gradient-based meta-learning algorithm and show that this
… significantly improve performance on large-scale deep reinforcement learning applications. …

被引用次数：348 相关文章所有 9 个版本

[PDF] usc.edu

Meta-learning in reinforcement learning

N Schweighofer, K Doya - Neural Networks, 2003 - Elsevier

… Meta-parameters in reinforcement learning should be tuned … meta-reinforcement learning
algorithm for tuning these meta-… appropriate meta-parameter values, and controls the meta-…

被引用次数：317 相关文章所有 13 个版本

[PDF] mlr.press

Offline meta-reinforcement learning with advantage weighting

E Mitchell, R Rafailov, XB Peng… - … Machine Learning, 2021 - proceedings.mlr.press

… This paper introduces the offline metareinforcement learning (offline meta-RL) problem …
Offline meta-RL is analogous to the widely successful supervised learning strategy of pre-training …

被引用次数：103 相关文章所有 8 个版本

高级搜索

QQ 群

Meta-reinforcement learning of structured exploration strategies

Learning to adapt in dynamic, real-world environments through meta-reinforcement learning

Meta-world: A benchmark and evaluation for multi-task and meta reinforcement learning

Meta reinforcement learning as task inference

相关搜索

A survey of meta-reinforcement learning

Prefrontal cortex as a meta-reinforcement learning system

On the effectiveness of fine-tuning versus meta-reinforcement learning

Meta-gradient reinforcement learning

Meta-learning in reinforcement learning

Offline meta-reinforcement learning with advantage weighting

引用