作者
Zengqi Peng, Xiao Zhou, Yubin Wang, Lei Zheng, Ming Liu, Jun Ma
发表日期
2023/9/24
研讨会论文
2023 IEEE 26th International Conference on Intelligent Transportation Systems (ITSC)
页码范围
5027-5033
出版商
IEEE
简介
Unsignalized intersections are typically considered as one of the most representative and challenging scenarios for self-driving vehicles. To tackle autonomous driving problems in such scenarios, this paper proposes a curriculum proximal policy optimization (CPPO) framework with stage-decaying clipping. By adjusting the clipping parameter during different stages of training through proximal policy optimization (PPO), the vehicle can first rapidly search for an approximate optimal policy or its neighborhood with a large parameter, and then converges to the optimal policy with a small one. Particularly, the stage-based curriculum learning technology is incorporated into the proposed framework to improve the generalization performance and further accelerate the training process. Moreover, the reward function is specially designed in view of different curriculum settings. A series of comparative experiments are …
引用总数
学术搜索中的文章
Z Peng, X Zhou, Y Wang, L Zheng, M Liu, J Ma - 2023 IEEE 26th International Conference on Intelligent …, 2023