Heuristic dynamic programming for mobile robot path planning based on Dyna approach

C Wong, E Yang, XT Yan, D Gu - 2017 NASA/ESA Conference …, 2017 - ieeexplore.ieee.org

The application of robotics and autonomous systems in space has increased dramatically.
The ongoing Mars rover mission involving the Curiosity rover, along with the success of its …

被引用次数：52 相关文章所有 8 个版本

An improved recurrent neural network for unmanned underwater vehicle online obstacle avoidance

C Lin, H Wang, J Yuan, D Yu, C Li - Ocean Engineering, 2019 - Elsevier

This paper focuses on online obstacle avoidance planning for unmanned underwater
vehicles. To improve the autonomous ability and intelligence of obstacle avoidance …

被引用次数：63 相关文章所有 3 个版本

[PDF] ieee.org

Hierarchical reinforcement learning for autonomous decision making and motion planning of intelligent vehicles

Y Lu, X Xu, X Zhang, L Qian, X Zhou - IEEE Access, 2020 - ieeexplore.ieee.org

Autonomous decision making and motion planning in complex dynamic traffic environments,
such as left-turn without traffic signals and multi-lane merging from side-ways, are still …

被引用次数：42 相关文章所有 2 个版本

[PDF] mdpi.com

Improved dyna-Q: a reinforcement learning method focused via heuristic graph for AGV path planning in dynamic environments

Y Liu, S Yan, Y Zhao, C Song, F Li - Drones, 2022 - mdpi.com

Dyna-Q is a reinforcement learning method widely used in AGV path planning. However, in
large complex dynamic environments, due to the sparse reward function of Dyna-Q and the …

被引用次数：14 相关文章所有 3 个版本

[PDF] ieee.org

Online Model-Free n-Step HDP With Stability Analysis

S Al-Dabooni, DC Wunsch - IEEE Transactions on Neural …, 2019 - ieeexplore.ieee.org

Because of a powerful temporal-difference (TD) with λ [TD (λ)] learning method, this paper
presents a novel n-step adaptive dynamic programming (ADP) architecture that combines …

被引用次数：30 相关文章所有 5 个版本

[PDF] ieee.org

An Improved N-Step Value Gradient Learning Adaptive Dynamic Programming Algorithm for Online Learning

S Al-Dabooni, DC Wunsch - IEEE Transactions on Neural …, 2019 - ieeexplore.ieee.org

In problems with complex dynamics and challenging state spaces, the dual heuristic
programming (DHP) algorithm has been shown theoretically and experimentally to perform …

被引用次数：36 相关文章所有 5 个版本

[PDF] academia.edu

Multi-objective grasshopper optimization algorithm for robot path planning in static environments

Z Elmi, MÖ Efe - 2018 IEEE International Conference on …, 2018 - ieeexplore.ieee.org

Finding the most appropriate path in robot navigation has been an interesting challenge in
recent years. A number of different techniques have been proposed to address this problem …

被引用次数：32 相关文章所有 4 个版本

[PDF] ieee.org

Speeding-up action learning in a social robot with dyna-q+: a bioinspired probabilistic model approach

M Maroto-Gómez, R González… - IEEE …, 2021 - ieeexplore.ieee.org

Robotic systems that are developed for social and dynamic environments require adaptive
mechanisms to successfully operate. Consequently, learning from rewards has provided …

被引用次数：12 相关文章所有 3 个版本

The Boundedness Conditions for Model-Free HDP( )

S Al-Dabooni, D Wunsch - IEEE Transactions on Neural …, 2018 - ieeexplore.ieee.org

This paper provides the stability analysis for a model-free action-dependent heuristic
dynamic programing (HDP) approach with an eligibility trace long-term prediction parameter …

被引用次数：19 相关文章所有 4 个版本

[PDF] wiley.com Full View

Research on UUV obstacle avoiding method based on recurrent neural networks

C Lin, H Wang, J Yuan, D Yu, C Li - Complexity, 2019 - Wiley Online Library

In this paper, we present an online obstacle avoidance planning method for unmanned
underwater vehicle (UUV) based on clockwork recurrent neural network (CW‐RNN) and …

被引用次数：13 相关文章所有 10 个版本

高级搜索

QQ 群