Deterministic policy gradient adaptive dynamic programming for model-free optimal control

D Liu, S Xue, B Zhao, B Luo… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org

This article reviews the recent development of adaptive dynamic programming (ADP) with
applications in control. First, its applications in optimal regulation are introduced, and some …

被引用次数：490 相关文章所有 2 个版本

Dual event-triggered constrained control through adaptive critic for discrete-time zero-sum games

D Wang, L Hu, M Zhao, J Qiao - IEEE Transactions on Systems …, 2022 - ieeexplore.ieee.org

In this article, through adaptive critic, a dual event-triggered (DET) constrained control
scheme is established for discrete-time nonlinear zero-sum games. The neural networks are …

被引用次数：83 相关文章

[PDF] mlr.press

Deterministic policy gradient algorithms

D Silver, G Lever, N Heess, T Degris… - International …, 2014 - proceedings.mlr.press

In this paper we consider deterministic policy gradient algorithms for reinforcement learning
with continuous actions. The deterministic policy gradient has a particularly appealing form …

被引用次数：5563 相关文章所有 32 个版本

Event-triggered control of discrete-time zero-sum games via deterministic policy gradient adaptive dynamic programming

Y Zhang, B Zhao, D Liu, S Zhang - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

In order to address zero-sum game problems for discrete-time (DT) nonlinear systems, this
article develops a novel event-triggered control (ETC) approach based on the deterministic …

被引用次数：73 相关文章

[PDF] springer.com

Collision-free path planning for welding manipulator via hybrid algorithm of deep reinforcement learning and inverse kinematics

J Zhong, T Wang, L Cheng - Complex & Intelligent Systems, 2021 - Springer

In actual welding scenarios, an effective path planner is needed to find a collision-free path
in the configuration space for the welding manipulator with obstacles around. However, as a …

被引用次数：69 相关文章所有 5 个版本

Barrier Lyapunov function-based safe reinforcement learning for autonomous vehicles with optimized backstepping

Y Zhang, X Liang, D Li, SS Ge, B Gao… - … on Neural Networks …, 2022 - ieeexplore.ieee.org

Guaranteed safety and performance under various circumstances remain technically critical
and practically challenging for the wide deployment of autonomous vehicles. Safety-critical …

被引用次数：43 相关文章所有 3 个版本

[HTML] sciencedirect.com

[HTML][HTML] Universal workflow of artificial intelligence for energy saving

D Lee, YT Chen, SL Chao - Energy Reports, 2022 - Elsevier

Artificial intelligence (AI) controls are commonly used to save energy. However, excessive
diversity in technological development has resulted in the inability to provide consistent …

被引用次数：39 相关文章所有 3 个版本

Adaptive dynamic programming for optimal control of discrete‐time nonlinear system with state constraints based on control barrier function

J Xu, J Wang, J Rao, Y Zhong… - International Journal of …, 2022 - Wiley Online Library

Adaptive dynamic programming (ADP) methods have demonstrated their efficiency.
However, many of the applications for which ADP offers great potential, are also safety …

被引用次数：43 相关文章所有 2 个版本

[PDF] researchgate.net

Goal representation adaptive critic design for discrete-time uncertain systems subjected to input constraints: The event-triggered case

S Zhao, J Wang, H Wang, H Xu - Neurocomputing, 2022 - Elsevier

In this article, the event-triggered near-optimal control issue is studied for the input-
constrained uncertain system with the input-to-state stability (ISS) attribute. In the proposed …

被引用次数：21 相关文章所有 3 个版本

[HTML] sciencedirect.com

[HTML][HTML] Actor–critic reinforcement learning and application in developing computer-vision-based interface tracking

O Dogru, K Velswamy, B Huang - Engineering, 2021 - Elsevier

This paper synchronizes control theory with computer vision by formalizing object tracking
as a sequential decision-making process. A reinforcement learning (RL) agent successfully …

被引用次数：26 相关文章所有 5 个版本

高级搜索

QQ 群