查看文章

arxiv.org 中的 [HTML]

Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning

作者

Lanqing Li*, Hai Zhang*, Xinyu Zhang, Shatong Zhu, Junqiao Zhao, Pheng-Ann Heng

发表日期

2024/2/4

期刊

arXiv preprint arXiv:2402.02429

简介

As a marriage between offline RL and meta-RL, the advent of offline meta-reinforcement learning (OMRL) has shown great promise in enabling RL agents to multi-task and quickly adapt while acquiring knowledge safely. Among which, Context-based OMRL (COMRL) as a popular paradigm, aims to learn a universal policy conditioned on effective task representations. In this work, by examining several key milestones in the field of COMRL, we propose to integrate these seemingly independent methodologies into a unified information theoretic framework. Most importantly, we show that the pre-existing COMRL algorithms are essentially optimizing the same mutual information objective between the task variable and its latent representation by implementing various approximate bounds. Based on the theoretical insight and the information bottleneck principle, we arrive at a novel algorithm dubbed UNICORN, which exhibits remarkable generalization across a broad spectrum of RL benchmarks, context shift scenarios, data qualities and deep learning architectures, attaining the new state-of-the-art. We believe that our framework could open up avenues for new optimality bounds and COMRL algorithms.

引用总数

被引用次数：1

20241

学术搜索中的文章

Towards an Information Theoretic Framework of Context-Based Offline Meta-Reinforcement Learning

L Li, H Zhang, X Zhang, S Zhu, J Zhao, PA Heng - arXiv preprint arXiv:2402.02429, 2024

被引用次数：1 相关文章所有 2 个版本