Recent progress in reinforcement learning and adaptive dynamic programming for advanced control applications

D Wang, N Gao, D Liu, J Li… - IEEE/CAA Journal of …, 2023 - ieeexplore.ieee.org
Reinforcement learning (RL) has roots in dynamic programming and it is called
adaptive/approximate dynamic programming (ADP) within the control community. This paper …

Adaptive dynamic programming for networked control systems under communication constraints: A survey of trends and techniques

X Wang, Y Sun, D Ding - International Journal of Network Dynamics and …, 2022 - sciltp.com
The adaptive dynamic programming (ADP) technology has been widely used benefiting
from its recursive structure in forward and the prospective conception of reinforcement …

The intelligent critic framework for advanced optimal control

D Wang, M Ha, M Zhao - Artificial Intelligence Review, 2022 - Springer
The idea of optimization can be regarded as an important basis of many disciplines and
hence is extremely useful for a large number of research fields, particularly for artificial …

Cooperative finitely excited learning for dynamical games

Y Yang, H Modares, KG Vamvoudakis… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
In this article, we propose a way to enhance the learning framework for zero-sum games
with dynamics evolving in continuous time. In contrast to the conventional centralized actor …

Dual event-triggered constrained control through adaptive critic for discrete-time zero-sum games

D Wang, L Hu, M Zhao, J Qiao - IEEE Transactions on Systems …, 2022 - ieeexplore.ieee.org
In this article, through adaptive critic, a dual event-triggered (DET) constrained control
scheme is established for discrete-time nonlinear zero-sum games. The neural networks are …

Adaptive neural-network boundary control for a flexible manipulator with input constraints and model uncertainties

Y Ren, Z Zhao, C Zhang, Q Yang… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
This article develops an adaptive neural-network (NN) boundary control scheme for a
flexible manipulator subject to input constraints, model uncertainties, and external …

Advanced value iteration for discrete-time intelligent critic control: A survey

M Zhao, D Wang, J Qiao, M Ha, J Ren - Artificial Intelligence Review, 2023 - Springer
Optimal control problems are ubiquitous in practical engineering applications and social life
with the idea of cost or resource conservation. Based on the critic learning scheme, adaptive …

Neuroadaptive control for complicated underactuated systems with simultaneous output and velocity constraints exerted on both actuated and unactuated states

T Yang, N Sun, Y Fang - IEEE Transactions on Neural …, 2021 - ieeexplore.ieee.org
Due to limited workspace and safety requirements for practical underactuated mechanical
systems, it is necessary to restrict all to-be-controlled variables and their velocities within …

Event-triggered control of discrete-time zero-sum games via deterministic policy gradient adaptive dynamic programming

Y Zhang, B Zhao, D Liu, S Zhang - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
In order to address zero-sum game problems for discrete-time (DT) nonlinear systems, this
article develops a novel event-triggered control (ETC) approach based on the deterministic …

Policy Iteration Q-Learning for Data-Based Two-Player Zero-Sum Game of Linear Discrete-Time Systems

B Luo, Y Yang, D Liu - IEEE Transactions on Cybernetics, 2020 - ieeexplore.ieee.org
In this article, the data-based two-player zero-sum game problem is considered for linear
discrete-time systems. This problem theoretically depends on solving the discrete-time game …