Q-cut—dynamic discovery of sub-goals in reinforcement learning

S Pateria, B Subagdja, A Tan, C Quek - ACM Computing Surveys (CSUR …, 2021 - dl.acm.org

Hierarchical Reinforcement Learning (HRL) enables autonomous decomposition of
challenging long-horizon decision-making tasks into simpler subtasks. During the past …

被引用次数：319 相关文章所有 5 个版本

[PDF] academia.edu

Deep learning in neural networks: An overview

J Schmidhuber - Neural networks, 2015 - Elsevier

In recent years, deep artificial neural networks (including recurrent ones) have won
numerous contests in pattern recognition and machine learning. This historical survey …

被引用次数：22447 相关文章所有 42 个版本

[PDF] nowpublishers.com

Model-based reinforcement learning: A survey

TM Moerland, J Broekens, A Plaat… - … and Trends® in …, 2023 - nowpublishers.com

Sequential decision making, commonly formalized as Markov Decision Process (MDP)
optimization, is an important challenge in artificial intelligence. Two key approaches to this …

被引用次数：692 相关文章所有 17 个版本

[PDF] neurips.cc

Hierarchical deep reinforcement learning: Integrating temporal abstraction and intrinsic motivation

TD Kulkarni, K Narasimhan, A Saeedi… - Advances in neural …, 2016 - proceedings.neurips.cc

Learning goal-directed behavior in environments with sparse feedback is a major challenge
for reinforcement learning algorithms. One of the key difficulties is insufficient exploration …

被引用次数：1403 相关文章所有 9 个版本

[PDF] aaai.org

The option-critic architecture

PL Bacon, J Harb, D Precup - Proceedings of the AAAI conference on …, 2017 - ojs.aaai.org

Temporal abstraction is key to scaling up learning and planning in reinforcement learning.
While planning with temporally extended actions is well understood, creating such …

被引用次数：1244 相关文章所有 14 个版本

[PDF] mlr.press

Modular multitask reinforcement learning with policy sketches

J Andreas, D Klein, S Levine - International conference on …, 2017 - proceedings.mlr.press

We describe a framework for multitask deep reinforcement learning guided by policy
sketches. Sketches annotate tasks with sequences of named subtasks, providing information …

被引用次数：544 相关文章所有 11 个版本

[PDF] jair.org

From skills to symbols: Learning symbolic representations for abstract high-level planning

G Konidaris, LP Kaelbling, T Lozano-Perez - Journal of Artificial Intelligence …, 2018 - jair.org

We consider the problem of constructing abstract representations for planning in high-
dimensional, continuous environments. We assume an agent equipped with a collection of …

被引用次数：339 相关文章所有 8 个版本

[PDF] mdpi.com

Hierarchical reinforcement learning: A survey and open research challenges

M Hutsebaut-Buysse, K Mets, S Latré - Machine Learning and Knowledge …, 2022 - mdpi.com

Reinforcement learning (RL) allows an agent to solve sequential decision-making problems
by interacting with an environment in a trial-and-error fashion. When these environments are …

被引用次数：74 相关文章所有 8 个版本

[PDF] arxiv.org

Learning multi-level hierarchies with hindsight

A Levy, G Konidaris, R Platt, K Saenko - arXiv preprint arXiv:1712.00948, 2017 - arxiv.org

Hierarchical agents have the potential to solve sequential decision making tasks with
greater sample efficiency than their non-hierarchical counterparts because hierarchical …

被引用次数：303 相关文章所有 11 个版本

[PDF] mlr.press

A laplacian framework for option discovery in reinforcement learning

MC Machado, MG Bellemare… - … on Machine Learning, 2017 - proceedings.mlr.press

Abstract Representation learning and option discovery are two of the biggest challenges in
reinforcement learning (RL). Proto-value functions (PVFs) are a well-known approach for …

被引用次数：306 相关文章所有 9 个版本

高级搜索

QQ 群