Planning-augmented hierarchical reinforcement learning

R Gieselmann, FT Pokorny - IEEE Robotics and Automation …, 2021 - ieeexplore.ieee.org
IEEE Robotics and Automation Letters, 2021ieeexplore.ieee.org
Planning algorithms are powerful at solving long-horizon decision-making problems but
require that environment dynamics are known. Model-free reinforcement learning has
recently been merged with graph-based planning to increase the robustness of trained
policies in state-space navigation problems. Recent ideas suggest to use planning in order
to provide intermediate waypoints guiding the policy in long-horizon tasks. Yet, it is not
always practical to describe a problem in the setting of state-to-state navigation. Often, the …
Planning algorithms are powerful at solving long-horizon decision-making problems but require that environment dynamics are known. Model-free reinforcement learning has recently been merged with graph-based planning to increase the robustness of trained policies in state-space navigation problems. Recent ideas suggest to use planning in order to provide intermediate waypoints guiding the policy in long-horizon tasks. Yet, it is not always practical to describe a problem in the setting of state-to-state navigation. Often, the goal is defined by one or multiple disjoint sets of valid states or implicitly using an abstract task description. Building upon previous efforts, we introduce a novel algorithm called Planning-Augmented Hierarchical Reinforcement Learning (PAHRL) which translates the concept of hybrid planning/RL to such problems with implicitly defined goal. Using a hierarchical framework, we divide the original task, formulated as a Markov Decision Process (MDP), into a hierarchy of shorter horizon MDPs. Actor-critic agents are trained in parallel for each level of the hierarchy. During testing, a planner then determines useful subgoals on a state graph constructed at the bottom level of the hierarchy. The effectiveness of our approach is demonstrated for a set of continuous control problems in simulation including robot arm reaching tasks and the manipulation of a deformable object.
ieeexplore.ieee.org
以上显示的是最相近的搜索结果。 查看全部搜索结果