J Chen, T Lan, V Aggarwal - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Hierarchical Imitation Learning (HIL) has been proposed to recover highly-complex behaviors in long-horizon tasks from expert demonstrations by modeling the task hierarchy …
J Chen, T Lan, V Aggarwal - IEEE Transactions on Neural …, 2023 - ieeexplore.ieee.org
Imitation learning (IL) has been proposed to recover the expert policy from demonstrations. However, it would be difficult to learn a single monolithic policy for highly complex long …
G Cheng, L Dong, W Cai, C Sun - IEEE Robotics and …, 2023 - ieeexplore.ieee.org
Multi-task learning is an important problem in reinforcement learning. Training multiple tasks together brings benefits from the shared useful information across different tasks and often …
M Jing, W Huang, F Sun, X Ma… - International …, 2021 - proceedings.mlr.press
It has been a challenge to learning skills for an agent from long-horizon unannotated demonstrations. Existing approaches like Hierarchical Imitation Learning (HIL) are prone to …
G Xiang, J Su - IEEE transactions on cybernetics, 2019 - ieeexplore.ieee.org
Reinforcement learning (RL) and imitation learning (IL), especially equipped with deep neural networks, have been widely studied for autonomous robotic skill acquisition and …
J Meng, F Zhu - Expert Systems with Applications, 2023 - Elsevier
Multi-task reinforcement learning is promising to alleviate the low sample efficiency and high computation cost of reinforcement learning algorithms. However, current methods mostly …
Imitation learning offers a promising path for robots to learn general-purpose behaviors, but traditionally has exhibited limited scalability due to high data supervision requirements and …
Meta-reinforcement learning algorithms can enable robots to acquire new skills much more quickly, by leveraging prior experience to learn how to learn. However, much of the current …
A Correia, LA Alexandre - 2023 IEEE/RSJ International …, 2023 - ieeexplore.ieee.org
Sequence models in reinforcement learning require task knowledge to estimate the task policy. This paper presents the hierarchical decision transformer (HDT). HDT is a …