Imitation learning from imperfection: Theoretical justifications and algorithms

Z Li, T Xu, Z Qin, Y Yu, ZQ Luo - Advances in Neural …, 2024 - proceedings.neurips.cc
Imitation learning (IL) algorithms excel in acquiring high-quality policies from expert data for
sequential decision-making tasks. But, their effectiveness is hampered when faced with …

Rank-based decomposable losses in machine learning: A survey

S Hu, X Wang, S Lyu - IEEE Transactions on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Recent works have revealed an essential paradigm in designing loss functions that
differentiate individual losses versus aggregate losses. The individual loss measures the …

Towards robust offline reinforcement learning under diverse data corruption

R Yang, H Zhong, J Xu, A Zhang, C Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org
Offline reinforcement learning (RL) presents a promising approach for learning reinforced
policies from offline datasets without the need for costly or unsafe interactions with the …

Unlabeled imperfect demonstrations in adversarial imitation learning

Y Wang, B Du, C Xu - Proceedings of the AAAI Conference on Artificial …, 2023 - ojs.aaai.org
Adversarial imitation learning has become a widely used imitation learning framework. The
discriminator is often trained by taking expert demonstrations and policy trajectories as …

Detecting incorrect visual demonstrations for improved policy learning

M Hussein, M Begum - Conference on Robot Learning, 2023 - proceedings.mlr.press
Learning tasks only from raw video demonstrations is the current state of the art in robotics
visual imitation learning research. The implicit assumption here is that all video …

[图书][B] Robust Behavior Cloning for Multi-Step Sequential Task Learning by Robots

M Hussein - 2023 - search.proquest.com
This research is about learning high-level policies of multi-step sequential (MSS) tasks–such
as activities of daily living–from demonstrations in a sample efficient manner. This research …

[PDF][PDF] How to Improve Imitation Learning Performance with Sub-optimal Supplementary Data?

Z Li, T Xu, Z Qin, Y Yu, ZQ Luo - dmlr.ai
Imitation learning (IL) is a machine learning technique that involves learning from examples
provided by an expert. IL algorithms can solve the sequential decision-making tasks but their …