Towards continual reinforcement learning: A review and perspectives

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022 - jair.org
In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …

A survey of zero-shot generalisation in deep reinforcement learning

R Kirk, A Zhang, E Grefenstette, T Rocktäschel - Journal of Artificial …, 2023 - jair.org
The study of zero-shot generalisation (ZSG) in deep Reinforcement Learning (RL) aims to
produce RL algorithms whose policies generalise well to novel unseen situations at …

A survey on causal reinforcement learning

Y Zeng, R Cai, F Sun, L Huang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
While reinforcement learning (RL) achieves tremendous success in sequential decision-
making problems of many domains, it still faces key challenges of data inefficiency and the …

Temporally disentangled representation learning

W Yao, G Chen, K Zhang - Advances in Neural Information …, 2022 - proceedings.neurips.cc
Recently in the field of unsupervised representation learning, strong identifiability results for
disentanglement of causally-related latent variables have been established by exploiting …

Factored adaptation for non-stationary reinforcement learning

F Feng, B Huang, K Zhang… - Advances in Neural …, 2022 - proceedings.neurips.cc
Dealing with non-stationarity in environments (eg, in the transition dynamics) and objectives
(eg, in the reward functions) is a challenging problem that is crucial in real-world …

Learning dynamic attribute-factored world models for efficient multi-object reinforcement learning

F Feng, S Magliacane - Advances in Neural Information …, 2024 - proceedings.neurips.cc
In many reinforcement learning tasks, the agent has to learn to interact with many objects of
different types and generalize to unseen combinations and numbers of objects. Often a task …

Temporally disentangled representation learning under unknown nonstationarity

X Song, W Yao, Y Fan, X Dong… - Advances in …, 2024 - proceedings.neurips.cc
In unsupervised causal representation learning for sequential data with time-delayed latent
causal influences, strong identifiability results for the disentanglement of causally-related …

Context shift reduction for offline meta-reinforcement learning

Y Gao, R Zhang, J Guo, F Wu, Q Yi… - Advances in …, 2024 - proceedings.neurips.cc
Offline meta-reinforcement learning (OMRL) utilizes pre-collected offline datasets to
enhance the agent's generalization ability on unseen tasks. However, the context shift …

Causal reinforcement learning: A survey

Z Deng, J Jiang, G Long, C Zhang - arXiv preprint arXiv:2307.01452, 2023 - arxiv.org
Reinforcement learning is an essential paradigm for solving sequential decision problems
under uncertainty. Despite many remarkable achievements in recent decades, applying …

Efficient symbolic policy learning with differentiable symbolic expression

J Guo, R Zhang, S Peng, Q Yi, X Hu… - Advances in …, 2024 - proceedings.neurips.cc
Deep reinforcement learning (DRL) has led to a wide range of advances in sequential
decision-making tasks. However, the complexity of neural network policies makes it difficult …