Proto-value functions: Developmental reinforcement learning

M Hutsebaut-Buysse, K Mets, S Latré - Machine Learning and Knowledge …, 2022 - mdpi.com

Reinforcement learning (RL) allows an agent to solve sequential decision-making problems
by interacting with an environment in a trial-and-error fashion. When these environments are …

被引用次数：81 相关文章所有 8 个版本

[PDF] romisatriawahono.net

Reinforcement learning algorithms with function approximation: Recent advances and applications

X Xu, L Zuo, Z Huang - Information sciences, 2014 - Elsevier

In recent years, the research on reinforcement learning (RL) has focused on function
approximation in learning prediction and control of Markov decision processes (MDPs). The …

被引用次数：217 相关文章所有 6 个版本

[PDF] mlr.press

A laplacian framework for option discovery in reinforcement learning

MC Machado, MG Bellemare… - … on Machine Learning, 2017 - proceedings.mlr.press

Abstract Representation learning and option discovery are two of the biggest challenges in
reinforcement learning (RL). Proto-value functions (PVFs) are a well-known approach for …

被引用次数：309 相关文章所有 9 个版本

[PDF] arxiv.org

A survey of exploration methods in reinforcement learning

S Amin, M Gomrokchi, H Satija, H Van Hoof… - arXiv preprint arXiv …, 2021 - arxiv.org

Exploration is an essential component of reinforcement learning algorithms, where agents
need to learn how to predict and control unknown and often stochastic environments …

被引用次数：103 相关文章所有 2 个版本

[PDF] mlr.press

Understanding self-predictive learning for reinforcement learning

Y Tang, ZD Guo, PH Richemond… - International …, 2023 - proceedings.mlr.press

We study the learning dynamics of self-predictive learning for reinforcement learning, a
family of algorithms that learn representations by minimizing the prediction error of their own …

被引用次数：28 相关文章所有 7 个版本

[PDF] aaai.org

Value function approximation in reinforcement learning using the Fourier basis

G Konidaris, S Osentoski, P Thomas - Proceedings of the AAAI …, 2011 - ojs.aaai.org

We describe the Fourier basis, a linear value function approximation scheme based on the
Fourier series. We empirically demonstrate that it performs well compared to radial basis …

被引用次数：549 相关文章所有 31 个版本

[PDF] arxiv.org

Eigenoption discovery through the deep successor representation

MC Machado, C Rosenbaum, X Guo, M Liu… - arXiv preprint arXiv …, 2017 - arxiv.org

Options in reinforcement learning allow agents to hierarchically decompose a task into
subtasks, having the potential to speed up learning and planning. However, autonomously …

被引用次数：172 相关文章所有 5 个版本

[PDF] jmlr.org

Temporal abstraction in reinforcement learning with the successor representation

MC Machado, A Barreto, D Precup… - Journal of Machine …, 2023 - jmlr.org

Reasoning at multiple levels of temporal abstraction is one of the key attributes of
intelligence. In reinforcement learning, this is often modeled through temporally extended …

被引用次数：48 相关文章所有 3 个版本

[PDF] jmlr.org

[PDF][PDF] Proto-value Functions: A Laplacian Framework for Learning Representation and Control in Markov Decision Processes.

S Mahadevan, M Maggioni - Journal of Machine Learning Research, 2007 - jmlr.org

This paper introduces a novel spectral framework for solving Markov decision processes
(MDPs) by jointly learning representations and optimal policies. The major components of …

被引用次数：399 相关文章所有 20 个版本

[PDF] umass.edu

Manifold alignment using procrustes analysis

C Wang, S Mahadevan - … of the 25th international conference on …, 2008 - dl.acm.org

In this paper we introduce a novel approach to manifold alignment, based on Procrustes
analysis. Our approach differs from" semi-supervised alignment" in that it results in a …

被引用次数：349 相关文章所有 18 个版本

高级搜索

QQ 群