Recent progress in reinforcement learning and adaptive dynamic programming for advanced control applications

D Wang, N Gao, D Liu, J Li… - IEEE/CAA Journal of …, 2023 - ieeexplore.ieee.org
Reinforcement learning (RL) has roots in dynamic programming and it is called
adaptive/approximate dynamic programming (ADP) within the control community. This paper …

Adaptive multi-step evaluation design with stability guarantee for discrete-time optimal learning control

D Wang, J Wang, M Zhao, P Xin… - IEEE/CAA Journal of …, 2023 - ieeexplore.ieee.org
This paper is concerned with a novel integrated multi-step heuristic dynamic programming
(MsHDP) algorithm for solving optimal control problems. It is shown that, initialized by the …

Offline and online adaptive critic control designs with stability guarantee through value iteration

M Ha, D Wang, D Liu - IEEE Transactions on Cybernetics, 2021 - ieeexplore.ieee.org
This article is concerned with the stability of the closed-loop system using various control
policies generated by value iteration. Some stability properties involving admissibility …

Discrete-time self-learning parallel control

Q Wei, L Wang, J Lu, FY Wang - IEEE Transactions on Systems …, 2020 - ieeexplore.ieee.org
In this article, a new self-learning parallel control method, which is based on adaptive
dynamic programming (ADP) technique, is developed for solving the optimal control …

A Novel Online Adaptive Dynamic Programming Algorithm With Adjustable Convergence Rate

Y Wang, Z Zhang, Y Zhang, M Liang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
This article develops a novel online adaptive dynamic programming algorithm with
adjustable convergence rate to address the optimal control problem of nonlinear systems …

Supplementary heuristic dynamic programming for wastewater treatment process control

D Wang, X Li, P Xin, A Liu, J Qiao - Expert Systems with Applications, 2024 - Elsevier
With the rapid development of industry, the amount of wastewater discharge is increasing. In
order to improve the efficiency of the wastewater treatment process (WWTP), we often desire …

An Improved N-Step Value Gradient Learning Adaptive Dynamic Programming Algorithm for Online Learning

S Al-Dabooni, DC Wunsch - IEEE Transactions on Neural …, 2019 - ieeexplore.ieee.org
In problems with complex dynamics and challenging state spaces, the dual heuristic
programming (DHP) algorithm has been shown theoretically and experimentally to perform …

Adaptive critic control with multi‐step policy evaluation for nonlinear zero‐sum games

X Li, D Wang, J Wang, J Qiao - International Journal of Robust …, 2024 - Wiley Online Library
To attenuate the effect of disturbances on control performance, a multi‐step adaptive critic
control (MsACC) framework is developed to solve zero‐sum games for discrete‐time …

Synchronous fault-tolerant near-optimal control for discrete-time nonlinear PE game

Y Yuan, P Zhang, X Li - IEEE Transactions on Neural Networks …, 2020 - ieeexplore.ieee.org
In this article, the synchronous fault-tolerant near-optimal control strategy design problem is
studied for a class of discrete-time nonlinear pursuit-evasion (PE) games. In the studied PE …

Efficient online globalized dual heuristic programming with an associated dual network

Y Zhou - IEEE Transactions on Neural Networks and Learning …, 2022 - ieeexplore.ieee.org
Globalized dual heuristic programming (GDHP) is the most comprehensive adaptive critic
design, which employs its critic to minimize the error with respect to both the cost-to-go and …