An introduction to deep reinforcement learning

V François-Lavet, P Henderson, R Islam… - … and Trends® in …, 2018 - nowpublishers.com
Deep reinforcement learning is the combination of reinforcement learning (RL) and deep
learning. This field of research has been able to solve a wide range of complex …

Information-theoretic considerations in batch reinforcement learning

J Chen, N Jiang - International Conference on Machine …, 2019 - proceedings.mlr.press
Value-function approximation methods that operate in batch mode have foundational
importance to reinforcement learning (RL). Finite sample guarantees for these methods …

Kinematic state abstraction and provably efficient rich-observation reinforcement learning

D Misra, M Henaff, A Krishnamurthy… - … on machine learning, 2020 - proceedings.mlr.press
We present an algorithm, HOMER, for exploration and reinforcement learning in rich
observation environments that are summarizable by an unknown latent state space. The …

For sale: State-action representation learning for deep reinforcement learning

S Fujimoto, WD Chang, E Smith… - Advances in …, 2024 - proceedings.neurips.cc
In reinforcement learning (RL), representation learning is a proven tool for complex image-
based tasks, but is often overlooked for environments with low-level states, such as physical …

[PDF][PDF] Transfer learning for reinforcement learning domains: A survey.

ME Taylor, P Stone - Journal of Machine Learning Research, 2009 - jmlr.org
The reinforcement learning paradigm is a popular way to address problems that have only
limited environmental feedback, rather than correctly labeled examples, as is common in …

Towards robust bisimulation metric learning

M Kemertas… - Advances in Neural …, 2021 - proceedings.neurips.cc
Learned representations in deep reinforcement learning (DRL) have to extract task-relevant
information from complex observations, balancing between robustness to distraction and …

Learning symmetric embeddings for equivariant world models

JY Park, O Biza, L Zhao, JW van de Meent… - arXiv preprint arXiv …, 2022 - arxiv.org
Incorporating symmetries can lead to highly data-efficient and generalizable models by
defining equivalence classes of data samples related by transformations. However …

Off-dynamics reinforcement learning: Training for transfer with domain classifiers

B Eysenbach, S Asawa, S Chaudhari, S Levine… - arXiv preprint arXiv …, 2020 - arxiv.org
We propose a simple, practical, and intuitive approach for domain adaptation in
reinforcement learning. Our approach stems from the idea that the agent's experience in the …

Causal dynamics learning for task-independent state abstraction

Z Wang, X Xiao, Z Xu, Y Zhu, P Stone - arXiv preprint arXiv:2206.13452, 2022 - arxiv.org
Learning dynamics models accurately is an important goal for Model-Based Reinforcement
Learning (MBRL), but most MBRL methods learn a dense dynamics model which is …

Continuous mdp homomorphisms and homomorphic policy gradient

S Rezaei-Shoshtari, R Zhao… - Advances in …, 2022 - proceedings.neurips.cc
Abstraction has been widely studied as a way to improve the efficiency and generalization of
reinforcement learning algorithms. In this paper, we study abstraction in the continuous …