查看文章

mdpi.com 中的 [HTML]

Deep deterministic policy gradient-based autonomous driving for mobile robots in sparse reward environments

作者

Minjae Park, Seok Young Lee, Jin Seok Hong, Nam Kyu Kwon

发表日期

2022/12/7

期刊

Sensors

卷号

期号

页码范围

9574

出版商

MDPI

简介

In this paper, we propose a deep deterministic policy gradient (DDPG)-based path-planning method for mobile robots by applying the hindsight experience replay (HER) technique to overcome the performance degradation resulting from sparse reward problems occurring in autonomous driving mobile robots. The mobile robot in our analysis was a robot operating system-based TurtleBot3, and the experimental environment was a virtual simulation based on Gazebo. A fully connected neural network was used as the DDPG network based on the actor–critic architecture. Noise was added to the actor network. The robot recognized an unknown environment by measuring distances using a laser sensor and determined the optimized policy to reach its destination. The HER technique improved the learning performance by generating three new episodes with normal experience from a failed episode. The proposed method demonstrated that the HER technique could help mitigate the sparse reward problem; this was further corroborated by the successful autonomous driving results obtained after applying the proposed method to two reward systems, as well as actual experimental results.

引用总数

被引用次数：11

202320243 8

学术搜索中的文章

Deep deterministic policy gradient-based autonomous driving for mobile robots in sparse reward environments

M Park, SY Lee, JS Hong, NK Kwon - Sensors, 2022

被引用次数：11 相关文章所有 8 个版本