[HTML][HTML] A multi-stage deep reinforcement learning with search-based optimization for air–ground unmanned system navigation

X Chen, Y Qi, Y Yin, Y Chen, L Liu, H Chen - Applied Sciences, 2023 - mdpi.com
An important challenge for air–ground unmanned systems achieving autonomy is
navigation, which is essential for them to accomplish various tasks in unknown …

[HTML][HTML] Improved dyna-Q: a reinforcement learning method focused via heuristic graph for AGV path planning in dynamic environments

Y Liu, S Yan, Y Zhao, C Song, F Li - Drones, 2022 - mdpi.com
Dyna-Q is a reinforcement learning method widely used in AGV path planning. However, in
large complex dynamic environments, due to the sparse reward function of Dyna-Q and the …

Karting racing: A revisit to PPO and SAC algorithm

C Xu, R Zhu, D Yang - 2021 International Conference on …, 2021 - ieeexplore.ieee.org
Proximal Policy Optimization (PPO) is a classical algorithm in reinforcement learning, which
has been tested in a collection of benchmark tasks. In this paper, we test PPO in Unity …

Reinforcement learning architecture for cyber–physical–social AI: state-of-the-art and perspectives

X Li, P Wang, X Jin, Q Jiang, W Zhou, S Yao - Artificial Intelligence Review, 2023 - Springer
As the extension of cyber–physical systems (CPSs), cyber–physical–social systems
(CPSSs) seamlessly integrate cyber space, physical space, and social space. CPSS provide …

A maximum divergence approach to optimal policy in deep reinforcement learning

Z Yang, H Qu, M Fu, W Hu… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Model-free reinforcement learning algorithms based on entropy regularized have achieved
good performance in control tasks. Those algorithms consider using the entropy-regularized …

[HTML][HTML] Deep reinforcement learning navigation via decision transformer in autonomous driving

L Ge, X Zhou, Y Li, Y Wang - Frontiers in Neurorobotics, 2024 - frontiersin.org
In real-world scenarios, making navigation decisions for autonomous driving involves a
sequential set of steps. These judgments are made based on partial observations of the …

Asynchronous curriculum experience replay: A deep reinforcement learning approach for UAV autonomous motion control in unknown dynamic environments

Z Hu, X Gao, K Wan, Q Wang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Unmanned aerial vehicles (UAVs) have been widely used in military warfare, and realizing
safely autonomous motion control (AMC) in complex unknown environments is a challenge …

Optimizing Reinforcement Learning Control Model In Furuta Pendulum And Transferring It to Real-World (JULY 2023)

MR Hong, S Kang, JG Lee, S Seo, S Han, JS Koh… - IEEE …, 2023 - ieeexplore.ieee.org
Reinforcement learning does not require explicit robot modeling as it learns on its own
based on data, but it has temporal and spatial constraints when transferred to real-world …

HGRL: Human-Driving-Data Guided Reinforcement Learning for Autonomous Driving

H Zhuang, H Chu, Y Wang, B Gao… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Reinforcement learning (RL) shows promise for autonomous driving decision-making.
However, designing appropriate reward functions to guide RL agents towards complex …

MPC-based reinforcement learning for a simplified freight mission of autonomous surface vehicles

W Cai, AB Kordabad, HN Esfahani… - 2021 60th IEEE …, 2021 - ieeexplore.ieee.org
In this work, we propose a Model Predictive Control (MPC)-based Reinforcement Learning
(RL) method for Autonomous Surface Vehicles (ASVs). The objective is to find an optimal …