A Review of Safe Reinforcement Learning: Methods, Theories and Applications

S Gu, L Yang, Y Du, G Chen, F Walter… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org
Reinforcement Learning (RL) has achieved tremendous success in many complex decision-
making tasks. However, safety concerns are raised during deploying RL in real-world …

Learning safe control for multi-robot systems: Methods, verification, and open challenges

K Garg, S Zhang, O So, C Dawson, C Fan - Annual Reviews in Control, 2024 - Elsevier
In this survey, we review the recent advances in control design methods for robotic multi-
agent systems (MAS), focusing on learning-based methods with safety considerations. We …

POCE: Primal Policy Optimization with Conservative Estimation for Multi-constraint Offline Reinforcement Learning

J Guan, L Shen, A Zhou, L Li, H Hu… - Proceedings of the …, 2024 - openaccess.thecvf.com
Multi-constraint offline reinforcement learning (RL) promises to learn policies that satisfy
both cumulative and state-wise costs from offline datasets. This arrangement provides an …

A finite-sample analysis of payoff-based independent learning in zero-sum stochastic games

Z Chen, K Zhang, E Mazumdar… - Advances in …, 2024 - proceedings.neurips.cc
In this work, we study two-player zero-sum stochastic games and develop a variant of the
smoothed best-response learning dynamics that combines independent learning dynamics …

Safe reinforcement learning for power system control: A review

P Yu, Z Wang, H Zhang, Y Song - arXiv preprint arXiv:2407.00681, 2024 - arxiv.org
The large-scale integration of intermittent renewable energy resources introduces increased
uncertainty and volatility to the supply side of power systems, thereby complicating system …

A human-centered safe robot reinforcement learning framework with interactive behaviors

S Gu, A Kshirsagar, Y Du, G Chen, J Peters… - Frontiers in …, 2023 - frontiersin.org
Deployment of Reinforcement Learning (RL) algorithms for robotics applications in the real
world requires ensuring the safety of the robot and its environment. Safe Robot RL (SRRL) is …

Multi-agent deep reinforcement learning for dynamic reconfigurable shop scheduling considering batch processing and worker cooperation

Y Li, X Li, L Gao, Z Lu - Robotics and Computer-Integrated Manufacturing, 2025 - Elsevier
Reconfigurable manufacturing system is considered as a promising next-generation
manufacturing paradigm. However, limited equipment and complex product processes add …

Dynamic routing for integrated satellite-terrestrial networks: A constrained multi-agent reinforcement learning approach

Y Lyu, H Hu, R Fan, Z Liu, J An… - IEEE Journal on Selected …, 2024 - ieeexplore.ieee.org
The integrated satellite-terrestrial network (ISTN) system has experienced significant growth,
offering seamless communication services in remote areas with limited terrestrial …

Safe Multiagent Learning With Soft Constrained Policy Optimization in Real Robot Control

S Gu, D Huang, M Wen, G Chen… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Due to a lack of safety considerations, a wide range of multiagent reinforcement learning
(MARL) applications are limited in real-world environments. Thus, ensuring MARL safety is …

Multi-agent reinforcement learning for autonomous driving: A survey

R Zhang, J Hou, F Walter, S Gu, J Guan… - arXiv preprint arXiv …, 2024 - arxiv.org
Reinforcement Learning (RL) is a potent tool for sequential decision-making and has
achieved performance surpassing human capabilities across many challenging real-world …