Eliminating Primacy Bias in Online Reinforcement Learning by Self-Distillation

J Li, H Shi, H Wu, C Zhao… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Excessive invalid explorations at the beginning of training lead deep reinforcement learning
process to fall into the risk of overfitting, further resulting in spurious decisions, which …

Contrast, Imitate, Adapt: Learning Robotic Skills From Raw Human Videos

Z Qian, M You, H Zhou, X Xu, H Fu… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Learning robotic skills from raw human videos remains a non-trivial challenge. Previous
works tackled this problem by leveraging behavior cloning or learning reward functions from …

[引用][C] Task-Oriented Self-Imitation Learning for Robotic Autonomous Skill Acquisition

C Ran, J Su - International Journal of Humanoid Robotics, 2024 - World Scientific
The inferior sample efficiency of reinforcement learning (RL) and the requirement for high-
quality demonstrations in imitation learning (IL) will hinder their application in real-world …