Meta-reinforcement learning of structured exploration strategies

A Gupta, R Mendonca, YX Liu… - Advances in neural …, 2018 - proceedings.neurips.cc
… arises with meta-learning exploration is: if our goal is to learn … diverse and challenging tasks
at meta-training time to acquire … reward tasks to meta-learn exploration strategies that work …

Learning to adapt in dynamic, real-world environments through meta-reinforcement learning

A Nagabandi, I Clavera, S Liu, RS Fearing… - arXiv preprint arXiv …, 2018 - arxiv.org
meta reinforcement learning approach that achieves online adaptation in dynamic environments.
To the best knowledge of the authors, this is the first meta-reinforcement learningmeta

Meta-world: A benchmark and evaluation for multi-task and meta reinforcement learning

T Yu, D Quillen, Z He, R Julian… - … on robot learning, 2020 - proceedings.mlr.press
… In order to study the capabilities of current multi-task and meta-reinforcement learning … We
contend that multi-task and meta reinforcement learning methods that aim to efficiently learn

Meta reinforcement learning as task inference

J Humplik, A Galashov, L Hasenclever… - arXiv preprint arXiv …, 2019 - arxiv.org
meta-training is a simple and cheap way to boost the performance of meta-RL agents; 2) We
show that we can train such meta-… that our agents can solve difficult meta-RL problems in …

A survey of meta-reinforcement learning

J Beck, R Vuorio, EZ Liu, Z Xiong, L Zintgraf… - arXiv preprint arXiv …, 2023 - arxiv.org
meta-RL: few-shot meta-RL. Here, the goal is to learn an RL algorithm capable of fast adaptation,
ie, learning … on a given task distribution, and meta-learn how to efficiently adapt to any …

Prefrontal cortex as a meta-reinforcement learning system

JX Wang, Z Kurth-Nelson, D Kumaran, D Tirumala… - Nature …, 2018 - nature.com
… Although we cannot rule out the possibility that forms of meta-learning may also emerge in
these loops, we observe that the meta-RL effect described in this paper emerges only when …

On the effectiveness of fine-tuning versus meta-reinforcement learning

M Zhao, P Abbeel, S James - Advances in Neural …, 2022 - proceedings.neurips.cc
… tasks in order to learn new ones quickly and efficiently. Meta-learning approaches have
emerged as a popular solution to achieve this. However, metareinforcement learning (meta-RL) …

Meta-gradient reinforcement learning

Z Xu, HP van Hasselt, D Silver - Advances in neural …, 2018 - proceedings.neurips.cc
… of learning. We derive a practical gradient-based meta-learning algorithm and show that this
… significantly improve performance on large-scale deep reinforcement learning applications. …

Meta-learning in reinforcement learning

N Schweighofer, K Doya - Neural Networks, 2003 - Elsevier
Meta-parameters in reinforcement learning should be tuned … meta-reinforcement learning
algorithm for tuning these meta-… appropriate meta-parameter values, and controls the meta-…

Offline meta-reinforcement learning with advantage weighting

E Mitchell, R Rafailov, XB Peng… - … Machine Learning, 2021 - proceedings.mlr.press
… This paper introduces the offline metareinforcement learning (offline meta-RL) problem …
Offline meta-RL is analogous to the widely successful supervised learning strategy of pre-training …