Advanced value iteration for discrete-time intelligent critic control: A survey

M Zhao, D Wang, J Qiao, M Ha, J Ren - Artificial Intelligence Review, 2023 - Springer
Optimal control problems are ubiquitous in practical engineering applications and social life
with the idea of cost or resource conservation. Based on the critic learning scheme, adaptive …

Offline and online adaptive critic control designs with stability guarantee through value iteration

M Ha, D Wang, D Liu - IEEE Transactions on Cybernetics, 2021 - ieeexplore.ieee.org
This article is concerned with the stability of the closed-loop system using various control
policies generated by value iteration. Some stability properties involving admissibility …

Evolving and incremental value iteration schemes for nonlinear discrete-time zero-sum games

M Zhao, D Wang, M Ha, J Qiao - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
In this article, evolving and incremental value iteration (VI) frameworks are constructed to
address the discrete-time zero-sum game problem. First, the evolving scheme means that …

Deep deterministic policy gradient with compatible critic network

D Wang, M Hu - IEEE Transactions on Neural Networks and …, 2021 - ieeexplore.ieee.org
Deep deterministic policy gradient (DDPG) is a powerful reinforcement learning algorithm for
large-scale continuous controls. DDPG runs the back-propagation from the state-action …

Self-organizing radial basis function neural network using accelerated second-order learning algorithm

HG Han, ML Ma, HY Yang, JF Qiao - Neurocomputing, 2022 - Elsevier
Gradient-based algorithms are commonly used for training radial basis function neural
network (RBFNN). However, it is still difficult to avoid vanishing gradient to improve the …

Stability and admissibility analysis for zero-sum games under general value iteration formulation

D Wang, M Zhao, M Ha, J Qiao - IEEE Transactions on Neural …, 2022 - ieeexplore.ieee.org
In this article, the general value iteration (GVI) algorithm for discrete-time zero-sum games is
investigated. The theoretical analysis focuses on stability properties of the systems and also …

Optimal evolution strategy for continuous strategy games on complex networks via reinforcement learning

L Fan, D Yu, KH Cheong, Z Wang - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
This article presents an optimal evolution strategy for continuous strategy games on complex
networks via reinforcement learning (RL). In the past, evolutionary game theory usually …

Event-triggered adaptive fuzzy optimal control for a class of strict-feedback nonlinear systems with external disturbances

W Zhang, J Yan, N Duan - IEEE Transactions on Fuzzy …, 2023 - ieeexplore.ieee.org
In this article, the problem of the event-triggered optimal control for a class of strict-feedback
nonlinear systems with external disturbances is investigated. First, by introducing proper low …

Decentralized control for large-scale systems with actuator faults and external disturbances: a data-driven method

Y Li, H Zhang, Z Wang, C Huang… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
This article investigates optimal control for a class of large-scale systems using a data-driven
method. The existing control methods for large-scale systems in this context separately …

Optimal trajectory tracking control for a class of nonlinear nonaffine systems via generalized N‐step value gradient learning

M Zhao, D Wang, J Qiao, L Hu - International Journal of Robust …, 2023 - Wiley Online Library
In this paper, the tracking control problem of unknown nonlinear systems is solved by using
the generalized N‐step value gradient learning algorithm with parameter λ λ GNSVGL (λ λ) …