Robust imitation learning from corrupted demonstrations

Z Li, T Xu, Z Qin, Y Yu, ZQ Luo - Advances in Neural …, 2024 - proceedings.neurips.cc

Imitation learning (IL) algorithms excel in acquiring high-quality policies from expert data for
sequential decision-making tasks. But, their effectiveness is hampered when faced with …

被引用次数：4 相关文章所有 3 个版本

[PDF] arxiv.org

Rank-based decomposable losses in machine learning: A survey

S Hu, X Wang, S Lyu - IEEE Transactions on Pattern Analysis …, 2023 - ieeexplore.ieee.org

Recent works have revealed an essential paradigm in designing loss functions that
differentiate individual losses versus aggregate losses. The individual loss measures the …

被引用次数：20 相关文章所有 8 个版本

[PDF] arxiv.org

Towards robust offline reinforcement learning under diverse data corruption

R Yang, H Zhong, J Xu, A Zhang, C Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org

Offline reinforcement learning (RL) presents a promising approach for learning reinforced
policies from offline datasets without the need for costly or unsafe interactions with the …

被引用次数：7 相关文章所有 3 个版本

[PDF] aaai.org

Unlabeled imperfect demonstrations in adversarial imitation learning

Y Wang, B Du, C Xu - Proceedings of the AAAI Conference on Artificial …, 2023 - ojs.aaai.org

Adversarial imitation learning has become a widely used imitation learning framework. The
discriminator is often trained by taking expert demonstrations and policy trajectories as …

被引用次数：6 相关文章所有 4 个版本

[PDF] mlr.press

Detecting incorrect visual demonstrations for improved policy learning

M Hussein, M Begum - Conference on Robot Learning, 2023 - proceedings.mlr.press

Learning tasks only from raw video demonstrations is the current state of the art in robotics
visual imitation learning research. The implicit assumption here is that all video …

被引用次数：2 相关文章所有 3 个版本

[PDF] proquest.com

[图书][B] Robust Behavior Cloning for Multi-Step Sequential Task Learning by Robots

M Hussein - 2023 - search.proquest.com

This research is about learning high-level policies of multi-step sequential (MSS) tasks–such
as activities of daily living–from demonstrations in a sample efficient manner. This research …

[PDF][PDF] How to Improve Imitation Learning Performance with Sub-optimal Supplementary Data?

Z Li, T Xu, Z Qin, Y Yu, ZQ Luo - dmlr.ai

Imitation learning (IL) is a machine learning technique that involves learning from examples
provided by an expert. IL algorithms can solve the sequential decision-making tasks but their …

高级搜索

QQ 群