Adaptive and intelligent navigation of autonomous planetary rovers—A survey

C Wong, E Yang, XT Yan, D Gu - 2017 NASA/ESA Conference …, 2017 - ieeexplore.ieee.org
The application of robotics and autonomous systems in space has increased dramatically.
The ongoing Mars rover mission involving the Curiosity rover, along with the success of its …

An improved recurrent neural network for unmanned underwater vehicle online obstacle avoidance

C Lin, H Wang, J Yuan, D Yu, C Li - Ocean Engineering, 2019 - Elsevier
This paper focuses on online obstacle avoidance planning for unmanned underwater
vehicles. To improve the autonomous ability and intelligence of obstacle avoidance …

Hierarchical reinforcement learning for autonomous decision making and motion planning of intelligent vehicles

Y Lu, X Xu, X Zhang, L Qian, X Zhou - IEEE Access, 2020 - ieeexplore.ieee.org
Autonomous decision making and motion planning in complex dynamic traffic environments,
such as left-turn without traffic signals and multi-lane merging from side-ways, are still …

Improved dyna-Q: a reinforcement learning method focused via heuristic graph for AGV path planning in dynamic environments

Y Liu, S Yan, Y Zhao, C Song, F Li - Drones, 2022 - mdpi.com
Dyna-Q is a reinforcement learning method widely used in AGV path planning. However, in
large complex dynamic environments, due to the sparse reward function of Dyna-Q and the …

Online Model-Free n-Step HDP With Stability Analysis

S Al-Dabooni, DC Wunsch - IEEE Transactions on Neural …, 2019 - ieeexplore.ieee.org
Because of a powerful temporal-difference (TD) with λ [TD (λ)] learning method, this paper
presents a novel n-step adaptive dynamic programming (ADP) architecture that combines …

An Improved N-Step Value Gradient Learning Adaptive Dynamic Programming Algorithm for Online Learning

S Al-Dabooni, DC Wunsch - IEEE Transactions on Neural …, 2019 - ieeexplore.ieee.org
In problems with complex dynamics and challenging state spaces, the dual heuristic
programming (DHP) algorithm has been shown theoretically and experimentally to perform …

Multi-objective grasshopper optimization algorithm for robot path planning in static environments

Z Elmi, MÖ Efe - 2018 IEEE International Conference on …, 2018 - ieeexplore.ieee.org
Finding the most appropriate path in robot navigation has been an interesting challenge in
recent years. A number of different techniques have been proposed to address this problem …

Speeding-up action learning in a social robot with dyna-q+: a bioinspired probabilistic model approach

M Maroto-Gómez, R González… - IEEE …, 2021 - ieeexplore.ieee.org
Robotic systems that are developed for social and dynamic environments require adaptive
mechanisms to successfully operate. Consequently, learning from rewards has provided …

The Boundedness Conditions for Model-Free HDP( )

S Al-Dabooni, D Wunsch - IEEE Transactions on Neural …, 2018 - ieeexplore.ieee.org
This paper provides the stability analysis for a model-free action-dependent heuristic
dynamic programing (HDP) approach with an eligibility trace long-term prediction parameter …

Research on UUV obstacle avoiding method based on recurrent neural networks

C Lin, H Wang, J Yuan, D Yu, C Li - Complexity, 2019 - Wiley Online Library
In this paper, we present an online obstacle avoidance planning method for unmanned
underwater vehicle (UUV) based on clockwork recurrent neural network (CW‐RNN) and …