- 学术资源搜索

Hierarchical reinforcement learning: A comprehensive survey

S Pateria, B Subagdja, A Tan, C Quek - ACM Computing Surveys (CSUR …, 2021 - dl.acm.org

Hierarchical Reinforcement Learning (HRL) enables autonomous decomposition of
challenging long-horizon decision-making tasks into simpler subtasks. During the past …

被引用次数：435 相关文章所有 5 个版本

[PDF] arxiv.org

Intelligent problem-solving as integrated hierarchical reinforcement learning

M Eppe, C Gumbsch, M Kerzel, PDH Nguyen… - Nature Machine …, 2022 - nature.com

According to cognitive psychology and related disciplines, the development of complex
problem-solving behaviour in biological agents depends on hierarchical cognitive …

被引用次数：90 相关文章所有 9 个版本

[PDF] sciencedirect.com

Disaster City Digital Twin: A vision for integrating artificial and human intelligence for disaster management

C Fan, C Zhang, A Yahja, A Mostafavi - International journal of information …, 2021 - Elsevier

This paper presents a vision for a Disaster City Digital Twin paradigm that can:(i) enable
interdisciplinary convergence in the field of crisis informatics and information and …

被引用次数：399 相关文章所有 6 个版本

[PDF] arxiv.org

Deep reinforcement learning

SE Li - Reinforcement learning for sequential decision and …, 2023 - Springer

Similar to humans, RL agents use interactive learning to successfully obtain satisfactory
decision strategies. However, in many cases, it is desirable to learn directly from …

被引用次数：434 相关文章所有 9 个版本

[PDF] neurips.cc

Learning to synthesize programs as interpretable and generalizable policies

D Trivedi, J Zhang, SH Sun… - Advances in neural …, 2021 - proceedings.neurips.cc

Recently, deep reinforcement learning (DRL) methods have achieved impressive
performance on tasks in a variety of domains. However, neural network policies produced …

被引用次数：75 相关文章所有 7 个版本

[PDF] mlr.press

Ltl2action: Generalizing ltl instructions for multi-task rl

P Vaezipoor, AC Li, RAT Icarte… - … on Machine Learning, 2021 - proceedings.mlr.press

We address the problem of teaching a deep reinforcement learning (RL) agent to follow
instructions in multi-task environments. Instructions are expressed in a well-known formal …

被引用次数：89 相关文章所有 9 个版本

[PDF] arxiv.org

Hierarchical reinforcement learning by discovering intrinsic options

J Zhang, H Yu, W Xu - arXiv preprint arXiv:2101.06521, 2021 - arxiv.org

We propose a hierarchical reinforcement learning method, HIDIO, that can learn task-
agnostic options in a self-supervised manner while jointly learning to utilize them to solve …

被引用次数：91 相关文章所有 3 个版本

[PDF] neurips.cc

Environment generation for zero-shot compositional reinforcement learning

I Gur, N Jaques, Y Miao, J Choi… - Advances in …, 2021 - proceedings.neurips.cc

Many real-world problems are compositional–solving them requires completing
interdependent sub-tasks, either in series or in parallel, that can be represented as a …

被引用次数：48 相关文章所有 8 个版本

[PDF] arxiv.org

Does zero-shot reinforcement learning exist?

A Touati, J Rapin, Y Ollivier - arXiv preprint arXiv:2209.14935, 2022 - arxiv.org

A zero-shot RL agent is an agent that can solve any RL task in a given environment,
instantly with no additional planning or learning, after an initial reward-free learning phase …

被引用次数：34 相关文章所有 4 个版本

[PDF] neurips.cc

Proto: Program-guided transformer for program-guided tasks

Z Zhao, K Samel, B Chen - Advances in neural …, 2021 - proceedings.neurips.cc

Programs, consisting of semantic and structural information, play an important role in the
communication between humans and agents. Towards learning general program executors …

被引用次数：39 相关文章所有 7 个版本

高级搜索

QQ 群

Hierarchical reinforcement learning: A comprehensive survey

Intelligent problem-solving as integrated hierarchical reinforcement learning

Disaster City Digital Twin: A vision for integrating artificial and human intelligence for disaster management

Deep reinforcement learning

Learning to synthesize programs as interpretable and generalizable policies

Ltl2action: Generalizing ltl instructions for multi-task rl

Hierarchical reinforcement learning by discovering intrinsic options

Environment generation for zero-shot compositional reinforcement learning

Does zero-shot reinforcement learning exist?

Proto: Program-guided transformer for program-guided tasks

引用