Structure-Aware Policy to Improve Generalization among Various Robots and Environments- 学术资源搜索

Structure-Aware Policy to Improve Generalization among Various Robots and Environments

W Xu, Y Gao, B Nie - 2022 IEEE International Conference on …, 2022 - ieeexplore.ieee.org

2022 IEEE International Conference on Robotics and Biomimetics (ROBIO), 2022•ieeexplore.ieee.org

Recently, Deep Reinforcement Learning (DRL) has been used to solve complex robot control tasks with outstanding success. However, previous DRL methods still exist some shortcomings, such as poor generalization performance, which makes policy performance quite sensitive to small vari-ations of the task settings. Besides, it is quite time-consuming and computationally expensive to retrain a new policy from scratch for new tasks, hence restricts the applications of DRL-based methods in the real world. In this work, we propose a novel DRL generalization method called GNN-embedding, which incorporates the robot hardware and the environment simultaneously with GNN-based policy network and learnable embedding vectors of tasks. Thus, it can learn a unified policy for different robots under different environment conditions, which improves the generalization performance of existing DRL robot policies. Multiple experiments on the Hopper-v2 robot are conducted. The experimental results demonstrate the effectiveness and efficiency of GNN-embedding on generalization, including multi-task learning and transfer learning problems.

ieeexplore.ieee.org

展开收起

被引用次数：1 相关文章

以上显示的是最相近的搜索结果。查看全部搜索结果

高级搜索

QQ 群

Structure-Aware Policy to Improve Generalization among Various Robots and Environments

引用