Metadiffuser: Diffusion model as conditional planner for offline meta-rl

F Ni, J Hao, Y Mu, Y Yuan, Y Zheng… - International …, 2023 - proceedings.mlr.press
Recently, diffusion model shines as a promising backbone for the sequence modeling
paradigm in offline reinforcement learning (RL). However, these works mostly lack the …

Focal: Efficient fully-offline meta-reinforcement learning via distance metric learning and behavior regularization

L Li, R Yang, D Luo - arXiv preprint arXiv:2010.01112, 2020 - arxiv.org
We study the offline meta-reinforcement learning (OMRL) problem, a paradigm which
enables reinforcement learning (RL) algorithms to quickly adapt to unseen tasks without any …

Context shift reduction for offline meta-reinforcement learning

Y Gao, R Zhang, J Guo, F Wu, Q Yi… - Advances in …, 2024 - proceedings.neurips.cc
Offline meta-reinforcement learning (OMRL) utilizes pre-collected offline datasets to
enhance the agent's generalization ability on unseen tasks. However, the context shift …

Domino: Decomposed mutual information optimization for generalized context in meta-reinforcement learning

Y Mu, Y Zhuang, F Ni, B Wang… - Advances in Neural …, 2022 - proceedings.neurips.cc
Adapting to the changes in transition dynamics is essential in robotic applications. By
learning a conditional policy with a compact context, context-aware meta-reinforcement …

Contrabar: Contrastive bayes-adaptive deep rl

E Choshen, A Tamar - International Conference on Machine …, 2023 - proceedings.mlr.press
In meta reinforcement learning (meta RL), an agent seeks a Bayes-optimal policy–the
optimal policy when facing an unknown task that is sampled from some known task …

In-context reinforcement learning for variable action spaces

V Sinii, A Nikulin, V Kurenkov, I Zisman… - arXiv preprint arXiv …, 2023 - arxiv.org
Recent work has shown that supervised pre-training on learning histories of RL algorithms
results in a model that captures the learning process and is able to improve in-context on …

Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning

L Li, H Zhang, X Zhang, S Zhu, J Zhao… - arXiv preprint arXiv …, 2024 - arxiv.org
As a marriage between offline RL and meta-RL, the advent of offline meta-reinforcement
learning (OMRL) has shown great promise in enabling RL agents to multi-task and quickly …

Learning Top-K Subtask Planning Tree Based on Discriminative Representation Pretraining for Decision-making

J Ruan, K Wang, Q Zhang, D Xing, B Xu - Machine Intelligence Research, 2024 - Springer
Decomposing complex real-world tasks into simpler subtasks and devising a subtask
execution plan is critical for humans to achieve effective decision-making. However …

On first-order meta-reinforcement learning with moreau envelopes

MT Toghani, S Perez-Salazar… - 2023 62nd IEEE …, 2023 - ieeexplore.ieee.org
Meta-Reinforcement Learning (MRL) is a promising framework for training agents that can
quickly adapt to new environments and tasks. In this work, we study the MRL problem under …

Skill-aware Mutual Information Optimisation for Generalisation in Reinforcement Learning

X Yu, M Dunion, X Li, SV Albrecht - arXiv preprint arXiv:2406.04815, 2024 - arxiv.org
Meta-Reinforcement Learning (Meta-RL) agents can struggle to operate across tasks with
varying environmental features that require different optimal skills (ie, different modes of …