Hierarchical reinforcement learning: A comprehensive survey

S Pateria, B Subagdja, A Tan, C Quek - ACM Computing Surveys (CSUR …, 2021 - dl.acm.org
Hierarchical Reinforcement Learning (HRL) enables autonomous decomposition of
challenging long-horizon decision-making tasks into simpler subtasks. During the past …

Intelligent problem-solving as integrated hierarchical reinforcement learning

M Eppe, C Gumbsch, M Kerzel, PDH Nguyen… - Nature Machine …, 2022 - nature.com
According to cognitive psychology and related disciplines, the development of complex
problem-solving behaviour in biological agents depends on hierarchical cognitive …

Disaster City Digital Twin: A vision for integrating artificial and human intelligence for disaster management

C Fan, C Zhang, A Yahja, A Mostafavi - International journal of information …, 2021 - Elsevier
This paper presents a vision for a Disaster City Digital Twin paradigm that can:(i) enable
interdisciplinary convergence in the field of crisis informatics and information and …

Deep reinforcement learning

SE Li - Reinforcement learning for sequential decision and …, 2023 - Springer
Similar to humans, RL agents use interactive learning to successfully obtain satisfactory
decision strategies. However, in many cases, it is desirable to learn directly from …

Learning to synthesize programs as interpretable and generalizable policies

D Trivedi, J Zhang, SH Sun… - Advances in neural …, 2021 - proceedings.neurips.cc
Recently, deep reinforcement learning (DRL) methods have achieved impressive
performance on tasks in a variety of domains. However, neural network policies produced …

Ltl2action: Generalizing ltl instructions for multi-task rl

P Vaezipoor, AC Li, RAT Icarte… - … on Machine Learning, 2021 - proceedings.mlr.press
We address the problem of teaching a deep reinforcement learning (RL) agent to follow
instructions in multi-task environments. Instructions are expressed in a well-known formal …

Hierarchical reinforcement learning by discovering intrinsic options

J Zhang, H Yu, W Xu - arXiv preprint arXiv:2101.06521, 2021 - arxiv.org
We propose a hierarchical reinforcement learning method, HIDIO, that can learn task-
agnostic options in a self-supervised manner while jointly learning to utilize them to solve …

Environment generation for zero-shot compositional reinforcement learning

I Gur, N Jaques, Y Miao, J Choi… - Advances in …, 2021 - proceedings.neurips.cc
Many real-world problems are compositional–solving them requires completing
interdependent sub-tasks, either in series or in parallel, that can be represented as a …

Does zero-shot reinforcement learning exist?

A Touati, J Rapin, Y Ollivier - arXiv preprint arXiv:2209.14935, 2022 - arxiv.org
A zero-shot RL agent is an agent that can solve any RL task in a given environment,
instantly with no additional planning or learning, after an initial reward-free learning phase …

Proto: Program-guided transformer for program-guided tasks

Z Zhao, K Samel, B Chen - Advances in neural …, 2021 - proceedings.neurips.cc
Programs, consisting of semantic and structural information, play an important role in the
communication between humans and agents. Towards learning general program executors …