Adarl: What, where, and how to adapt in transfer reinforcement learning

K Khetarpal, M Riemer, I Rish, D Precup - Journal of Artificial Intelligence …, 2022 - jair.org

In this article, we aim to provide a literature review of different formulations and approaches
to continual reinforcement learning (RL), also known as lifelong or non-stationary RL. We …

被引用次数：333 相关文章所有 9 个版本

[PDF] jair.org Full View

A survey of zero-shot generalisation in deep reinforcement learning

R Kirk, A Zhang, E Grefenstette, T Rocktäschel - Journal of Artificial …, 2023 - jair.org

The study of zero-shot generalisation (ZSG) in deep Reinforcement Learning (RL) aims to
produce RL algorithms whose policies generalise well to novel unseen situations at …

被引用次数：408 相关文章所有 9 个版本

[PDF] arxiv.org

A survey on causal reinforcement learning

Y Zeng, R Cai, F Sun, L Huang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

While reinforcement learning (RL) achieves tremendous success in sequential decision-
making problems of many domains, it still faces key challenges of data inefficiency and the …

被引用次数：23 相关文章所有 2 个版本

[PDF] neurips.cc

Temporally disentangled representation learning

W Yao, G Chen, K Zhang - Advances in Neural Information …, 2022 - proceedings.neurips.cc

Recently in the field of unsupervised representation learning, strong identifiability results for
disentanglement of causally-related latent variables have been established by exploiting …

被引用次数：53 相关文章所有 7 个版本

[PDF] neurips.cc

Factored adaptation for non-stationary reinforcement learning

F Feng, B Huang, K Zhang… - Advances in Neural …, 2022 - proceedings.neurips.cc

Dealing with non-stationarity in environments (eg, in the transition dynamics) and objectives
(eg, in the reward functions) is a challenging problem that is crucial in real-world …

被引用次数：38 相关文章所有 9 个版本

[PDF] neurips.cc

Learning dynamic attribute-factored world models for efficient multi-object reinforcement learning

F Feng, S Magliacane - Advances in Neural Information …, 2024 - proceedings.neurips.cc

In many reinforcement learning tasks, the agent has to learn to interact with many objects of
different types and generalize to unseen combinations and numbers of objects. Often a task …

被引用次数：9 相关文章所有 7 个版本

[PDF] neurips.cc

Temporally disentangled representation learning under unknown nonstationarity

X Song, W Yao, Y Fan, X Dong… - Advances in …, 2024 - proceedings.neurips.cc

In unsupervised causal representation learning for sequential data with time-delayed latent
causal influences, strong identifiability results for the disentanglement of causally-related …

被引用次数：9 相关文章所有 5 个版本

[PDF] neurips.cc

Context shift reduction for offline meta-reinforcement learning

Y Gao, R Zhang, J Guo, F Wu, Q Yi… - Advances in …, 2024 - proceedings.neurips.cc

Offline meta-reinforcement learning (OMRL) utilizes pre-collected offline datasets to
enhance the agent's generalization ability on unseen tasks. However, the context shift …

被引用次数：11 相关文章所有 6 个版本

[PDF] arxiv.org

Causal reinforcement learning: A survey

Z Deng, J Jiang, G Long, C Zhang - arXiv preprint arXiv:2307.01452, 2023 - arxiv.org

Reinforcement learning is an essential paradigm for solving sequential decision problems
under uncertainty. Despite many remarkable achievements in recent decades, applying …

被引用次数：15 相关文章所有 5 个版本

[PDF] neurips.cc

Efficient symbolic policy learning with differentiable symbolic expression

J Guo, R Zhang, S Peng, Q Yi, X Hu… - Advances in …, 2024 - proceedings.neurips.cc

Deep reinforcement learning (DRL) has led to a wide range of advances in sequential
decision-making tasks. However, the complexity of neural network policies makes it difficult …

被引用次数：3 相关文章所有 5 个版本

高级搜索

QQ 群