An algebraic approach to abstraction in reinforcement learning

V François-Lavet, P Henderson, R Islam… - … and Trends® in …, 2018 - nowpublishers.com

Deep reinforcement learning is the combination of reinforcement learning (RL) and deep
learning. This field of research has been able to solve a wide range of complex …

被引用次数：1858 相关文章所有 16 个版本

[PDF] mlr.press

Information-theoretic considerations in batch reinforcement learning

J Chen, N Jiang - International Conference on Machine …, 2019 - proceedings.mlr.press

Value-function approximation methods that operate in batch mode have foundational
importance to reinforcement learning (RL). Finite sample guarantees for these methods …

被引用次数：397 相关文章所有 8 个版本

[PDF] mlr.press

Kinematic state abstraction and provably efficient rich-observation reinforcement learning

D Misra, M Henaff, A Krishnamurthy… - … on machine learning, 2020 - proceedings.mlr.press

We present an algorithm, HOMER, for exploration and reinforcement learning in rich
observation environments that are summarizable by an unknown latent state space. The …

被引用次数：193 相关文章所有 8 个版本

[PDF] neurips.cc

For sale: State-action representation learning for deep reinforcement learning

S Fujimoto, WD Chang, E Smith… - Advances in …, 2024 - proceedings.neurips.cc

In reinforcement learning (RL), representation learning is a proven tool for complex image-
based tasks, but is often overlooked for environments with low-level states, such as physical …

被引用次数：37 相关文章所有 5 个版本

[PDF] jmlr.org

[PDF][PDF] Transfer learning for reinforcement learning domains: A survey.

ME Taylor, P Stone - Journal of Machine Learning Research, 2009 - jmlr.org

The reinforcement learning paradigm is a popular way to address problems that have only
limited environmental feedback, rather than correctly labeled examples, as is common in …

被引用次数：2380 相关文章所有 24 个版本

[PDF] neurips.cc

Towards robust bisimulation metric learning

M Kemertas… - Advances in Neural …, 2021 - proceedings.neurips.cc

Learned representations in deep reinforcement learning (DRL) have to extract task-relevant
information from complex observations, balancing between robustness to distraction and …

被引用次数：44 相关文章所有 10 个版本

[PDF] arxiv.org

Learning symmetric embeddings for equivariant world models

JY Park, O Biza, L Zhao, JW van de Meent… - arXiv preprint arXiv …, 2022 - arxiv.org

Incorporating symmetries can lead to highly data-efficient and generalizable models by
defining equivalence classes of data samples related by transformations. However …

被引用次数：45 相关文章所有 4 个版本

[PDF] arxiv.org

Off-dynamics reinforcement learning: Training for transfer with domain classifiers

B Eysenbach, S Asawa, S Chaudhari, S Levine… - arXiv preprint arXiv …, 2020 - arxiv.org

We propose a simple, practical, and intuitive approach for domain adaptation in
reinforcement learning. Our approach stems from the idea that the agent's experience in the …

被引用次数：84 相关文章所有 5 个版本

[PDF] arxiv.org

Causal dynamics learning for task-independent state abstraction

Z Wang, X Xiao, Z Xu, Y Zhu, P Stone - arXiv preprint arXiv:2206.13452, 2022 - arxiv.org

Learning dynamics models accurately is an important goal for Model-Based Reinforcement
Learning (MBRL), but most MBRL methods learn a dense dynamics model which is …

被引用次数：35 相关文章所有 11 个版本

[PDF] neurips.cc

Continuous mdp homomorphisms and homomorphic policy gradient

S Rezaei-Shoshtari, R Zhao… - Advances in …, 2022 - proceedings.neurips.cc

Abstraction has been widely studied as a way to improve the efficiency and generalization of
reinforcement learning algorithms. In this paper, we study abstraction in the continuous …

被引用次数：20 相关文章所有 6 个版本

高级搜索

QQ 群