Data-driven optimal consensus control for discrete-time multi-agent systems with unknown...

K Zhang, Z Yang, T Başar - Handbook of reinforcement learning and …, 2021 - Springer

Recent years have witnessed significant advances in reinforcement learning (RL), which
has registered tremendous success in solving various sequential decision-making problems …

被引用次数：1404 相关文章所有 8 个版本

Reinforcement learning in robotic applications: a comprehensive survey

B Singh, R Kumar, VP Singh - Artificial Intelligence Review, 2022 - Springer

In recent trends, artificial intelligence (AI) is used for the creation of complex automated
control systems. Still, researchers are trying to make a completely autonomous system that …

被引用次数：188 相关文章所有 4 个版本

[PDF] arxiv.org

A review of cooperative multi-agent deep reinforcement learning

A Oroojlooy, D Hajinezhad - Applied Intelligence, 2023 - Springer

Abstract Deep Reinforcement Learning has made significant progress in multi-agent
systems in recent years. The aim of this review article is to provide an overview of recent …

被引用次数：405 相关文章所有 8 个版本

Distributed fault-tolerant containment control protocols for the discrete-time multiagent systems via reinforcement learning method

T Li, W Bai, Q Liu, Y Long… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

This article investigates the model-free fault-tolerant containment control problem for
multiagent systems (MASs) with time-varying actuator faults. Depending on the relative state …

被引用次数：133 相关文章所有 3 个版本

Adaptive optimized consensus control for a class of nonlinear multi-agent systems with asymmetric input saturation constraints and hybrid faults

F Tang, H Wang, L Zhang, N Xu, AM Ahmad - … in Nonlinear Science and …, 2023 - Elsevier

This article studies the adaptive optimized leader–follower consensus control problem for a
class of discrete-time multi-agent systems with asymmetric input saturation constraints and …

被引用次数：46 相关文章所有 3 个版本

多智能体深度强化学习的若干关键科学问题

孙长银，穆朝絮 - 自动化学报, 2020 - aas.net.cn

强化学习作为一种用于解决无模型序列决策问题的方法已经有数十年的历史,
但强化学习方法在处理高维变量问题时常常会面临巨大挑战. 近年来, 深度学习迅猛发展 …

被引用次数：45 相关文章所有 5 个版本

Data-driven iterative adaptive critic control toward an urban wastewater treatment plant

D Wang, M Ha, J Qiao - IEEE Transactions on Industrial …, 2020 - ieeexplore.ieee.org

The wastewater treatment is an important avenue of resources cyclic utilization when coping
with the modern urban diseases. However, there always exist obvious nonlinearities and …

被引用次数：180 相关文章

Advanced value iteration for discrete-time intelligent critic control: A survey

M Zhao, D Wang, J Qiao, M Ha, J Ren - Artificial Intelligence Review, 2023 - Springer

Optimal control problems are ubiquitous in practical engineering applications and social life
with the idea of cost or resource conservation. Based on the critic learning scheme, adaptive …

被引用次数：31 相关文章所有 2 个版本

[PDF] unit.no

Distributed adaptive leader–follower and leaderless consensus control of a class of strict-feedback nonlinear systems: A unified approach

J Huang, W Wang, C Wen, J Zhou, G Li - Automatica, 2020 - Elsevier

In this paper, distributed adaptive consensus for a class of strict-feedback nonlinear systems
under directed topology condition is investigated. Both leader–follower and leaderless …

被引用次数：139 相关文章所有 3 个版本

[PDF] gatech.edu

Platoon control of connected multi-vehicle systems under V2X communications: Design and experiments

Y Li, W Chen, S Peeta, Y Wang - IEEE Transactions on …, 2019 - ieeexplore.ieee.org

This paper focuses on platoon control of multi-vehicle systems in a realistic vehicle-to-
vehicle/vehicle-to-infrastructure (V2V/V2I, or V2X) communication environment. To this end …

被引用次数：166 相关文章所有 5 个版本

高级搜索

QQ 群