Hierarchical reinforcement learning: A survey and open research challenges

M Hutsebaut-Buysse, K Mets, S Latré - Machine Learning and Knowledge …, 2022 - mdpi.com
Reinforcement learning (RL) allows an agent to solve sequential decision-making problems
by interacting with an environment in a trial-and-error fashion. When these environments are …

Reinforcement learning algorithms with function approximation: Recent advances and applications

X Xu, L Zuo, Z Huang - Information sciences, 2014 - Elsevier
In recent years, the research on reinforcement learning (RL) has focused on function
approximation in learning prediction and control of Markov decision processes (MDPs). The …

A laplacian framework for option discovery in reinforcement learning

MC Machado, MG Bellemare… - … on Machine Learning, 2017 - proceedings.mlr.press
Abstract Representation learning and option discovery are two of the biggest challenges in
reinforcement learning (RL). Proto-value functions (PVFs) are a well-known approach for …

A survey of exploration methods in reinforcement learning

S Amin, M Gomrokchi, H Satija, H Van Hoof… - arXiv preprint arXiv …, 2021 - arxiv.org
Exploration is an essential component of reinforcement learning algorithms, where agents
need to learn how to predict and control unknown and often stochastic environments …

Understanding self-predictive learning for reinforcement learning

Y Tang, ZD Guo, PH Richemond… - International …, 2023 - proceedings.mlr.press
We study the learning dynamics of self-predictive learning for reinforcement learning, a
family of algorithms that learn representations by minimizing the prediction error of their own …

Value function approximation in reinforcement learning using the Fourier basis

G Konidaris, S Osentoski, P Thomas - Proceedings of the AAAI …, 2011 - ojs.aaai.org
We describe the Fourier basis, a linear value function approximation scheme based on the
Fourier series. We empirically demonstrate that it performs well compared to radial basis …

Eigenoption discovery through the deep successor representation

MC Machado, C Rosenbaum, X Guo, M Liu… - arXiv preprint arXiv …, 2017 - arxiv.org
Options in reinforcement learning allow agents to hierarchically decompose a task into
subtasks, having the potential to speed up learning and planning. However, autonomously …

Temporal abstraction in reinforcement learning with the successor representation

MC Machado, A Barreto, D Precup… - Journal of Machine …, 2023 - jmlr.org
Reasoning at multiple levels of temporal abstraction is one of the key attributes of
intelligence. In reinforcement learning, this is often modeled through temporally extended …

[PDF][PDF] Proto-value Functions: A Laplacian Framework for Learning Representation and Control in Markov Decision Processes.

S Mahadevan, M Maggioni - Journal of Machine Learning Research, 2007 - jmlr.org
This paper introduces a novel spectral framework for solving Markov decision processes
(MDPs) by jointly learning representations and optimal policies. The major components of …

Manifold alignment using procrustes analysis

C Wang, S Mahadevan - … of the 25th international conference on …, 2008 - dl.acm.org
In this paper we introduce a novel approach to manifold alignment, based on Procrustes
analysis. Our approach differs from" semi-supervised alignment" in that it results in a …