The Distracting Control Suite--A Challenging Benchmark for Reinforcement Learning from Pixels

R Kirk, A Zhang, E Grefenstette, T Rocktäschel - Journal of Artificial …, 2023 - jair.org

The study of zero-shot generalisation (ZSG) in deep Reinforcement Learning (RL) aims to
produce RL algorithms whose policies generalise well to novel unseen situations at …

被引用次数：309 相关文章所有 9 个版本

[PDF] arxiv.org

A comprehensive survey of data augmentation in visual reinforcement learning

G Ma, Z Wang, Z Yuan, X Wang, B Yuan… - arXiv preprint arXiv …, 2022 - arxiv.org

Visual reinforcement learning (RL), which makes decisions directly from high-dimensional
visual inputs, has demonstrated significant potential in various domains. However …

被引用次数：20 相关文章所有 3 个版本

[PDF] arxiv.org

Contrastive behavioral similarity embeddings for generalization in reinforcement learning

R Agarwal, MC Machado, PS Castro… - arXiv preprint arXiv …, 2021 - arxiv.org

Reinforcement learning methods trained on few environments rarely learn policies that
generalize to unseen environments. To improve generalization, we incorporate the inherent …

被引用次数：187 相关文章所有 11 个版本

[PDF] neurips.cc

Stabilizing deep q-learning with convnets and vision transformers under data augmentation

N Hansen, H Su, X Wang - Advances in neural information …, 2021 - proceedings.neurips.cc

While agents trained by Reinforcement Learning (RL) can solve increasingly challenging
tasks directly from visual observations, generalizing learned skills to novel environments …

被引用次数：105 相关文章所有 10 个版本

[PDF] neurips.cc

Why generalization in rl is difficult: Epistemic pomdps and implicit partial observability

D Ghosh, J Rahme, A Kumar, A Zhang… - Advances in neural …, 2021 - proceedings.neurips.cc

Generalization is a central challenge for the deployment of reinforcement learning (RL)
systems in the real world. In this paper, we show that the sequential structure of the RL …

被引用次数：98 相关文章所有 10 个版本

[PDF] arxiv.org

Rrl: Resnet as representation for reinforcement learning

R Shah, V Kumar - arXiv preprint arXiv:2107.03380, 2021 - arxiv.org

The ability to autonomously learn behaviors via direct interactions in uninstrumented
environments can lead to generalist robots capable of enhancing productivity or providing …

被引用次数：94 相关文章所有 6 个版本

[PDF] mlr.press

Dreamerpro: Reconstruction-free model-based reinforcement learning with prototypical representations

F Deng, I Jang, S Ahn - International conference on machine …, 2022 - proceedings.mlr.press

Abstract Reconstruction-based Model-Based Reinforcement Learning (MBRL) agents, such
as Dreamer, often fail to discard task-irrelevant visual distractions that are prevalent in …

被引用次数：48 相关文章所有 9 个版本

[PDF] neurips.cc

Rl-vigen: A reinforcement learning benchmark for visual generalization

Z Yuan, S Yang, P Hua, C Chang… - Advances in Neural …, 2024 - proceedings.neurips.cc

Abstract Visual Reinforcement Learning (Visual RL), coupled with high-dimensional
observations, has consistently confronted the long-standing challenge of out-of-distribution …

被引用次数：6 相关文章所有 5 个版本

[PDF] mlr.press

Learning task informed abstractions

X Fu, G Yang, P Agrawal… - … Conference on Machine …, 2021 - proceedings.mlr.press

Current model-based reinforcement learning methods struggle when operating from
complex visual scenes due to their inability to prioritize task-relevant features. To mitigate …

被引用次数：58 相关文章所有 9 个版本

[PDF] neurips.cc

Repo: Resilient model-based reinforcement learning by regularizing posterior predictability

C Zhu, M Simchowitz, S Gadipudi… - Advances in Neural …, 2024 - proceedings.neurips.cc

Visual model-based RL methods typically encode image observations into low-dimensional
representations in a manner that does not eliminate redundant information. This leaves them …

被引用次数：4 相关文章所有 5 个版本

高级搜索

QQ 群