Recent progress in reinforcement learning and adaptive dynamic programming for advanced control applications

D Wang, N Gao, D Liu, J Li… - IEEE/CAA Journal of …, 2023 - ieeexplore.ieee.org
Reinforcement learning (RL) has roots in dynamic programming and it is called
adaptive/approximate dynamic programming (ADP) within the control community. This paper …

Adaptive practical optimal time-varying formation tracking control for disturbed high-order multi-agent systems

J Yu, X Dong, Q Li, J Lü, Z Ren - IEEE Transactions on Circuits …, 2022 - ieeexplore.ieee.org
The adaptive practical optimal time-varying formation tracking problems of the disturbed
high-order multi-agent systems with a noncooperative leader are considered. Different from …

A metaverse-based teaching building evacuation training system with deep reinforcement learning

J Gu, J Wang, X Guo, G Liu, S Qin… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
With the development of IoT, virtual reality, cloud computing, and digital twin technologies,
the advent of metaverse has attracted increasing world attention. Metaverse integrates and …

A multistage game in smart grid security: A reinforcement learning solution

Z Ni, S Paul - IEEE transactions on neural networks and …, 2019 - ieeexplore.ieee.org
Existing smart grid security research investigates different attack techniques and cascading
failures from the attackers' viewpoints, while the defenders' or the operators' protection …

Event-triggered-based adaptive dynamic programming for distributed formation control of multi-UAV

L Dou, S Cai, X Zhang, X Su, R Zhang - Journal of the Franklin Institute, 2022 - Elsevier
This paper is concerned with the distributed formation control problem of multi-quadrotor
unmanned aerial vehicle (UAV) in the framework of event triggering. First, for the position …

Optimal consensus control design for multiagent systems with multiple time delay using adaptive dynamic programming

H Zhang, H Ren, Y Mu, J Han - IEEE transactions on …, 2021 - ieeexplore.ieee.org
In this article, a novel data-based adaptive dynamic programming (ADP) method is
presented to solve the optimal consensus tracking control problem for discrete-time (DT) …

Optimal tracking control of nonlinear multiagent systems using internal reinforce Q-learning

Z Peng, R Luo, J Hu, K Shi, SK Nguang… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
In this article, a novel reinforcement learning (RL) method is developed to solve the optimal
tracking control problem of unknown nonlinear multiagent systems (MASs). Different from …

Learning automata-based multiagent reinforcement learning for optimization of cooperative tasks

Z Zhang, D Wang, J Gao - IEEE transactions on neural …, 2020 - ieeexplore.ieee.org
Multiagent reinforcement learning (MARL) has been extensively used in many applications
for its tractable implementation and task distribution. Learning automata, which can be …

Event-triggered communication network with limited-bandwidth constraint for multi-agent reinforcement learning

G Hu, Y Zhu, D Zhao, M Zhao… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Communicating agents with each other in a distributed manner and behaving as a group are
essential in multi-agent reinforcement learning. However, real-world multi-agent systems …

Predictor-based extended-state-observer design for consensus of MASs with delays and disturbances

C Wang, Z Zuo, Z Qi, Z Ding - IEEE transactions on cybernetics, 2018 - ieeexplore.ieee.org
In this paper, we study output feedback leader-follower consensus problem for multiagent
systems subject to external disturbances and time delays in both input and output. First, we …