Omnisafe: An infrastructure for accelerating safe reinforcement learning research

J Ji, J Zhou, B Zhang, J Dai, X Pan, R Sun… - arXiv preprint arXiv …, 2023 - arxiv.org
AI systems empowered by reinforcement learning (RL) algorithms harbor the immense
potential to catalyze societal advancement, yet their deployment is often impeded by …

Safety gymnasium: A unified safe reinforcement learning benchmark

J Ji, B Zhang, J Zhou, X Pan… - Advances in …, 2023 - proceedings.neurips.cc
Artificial intelligence (AI) systems possess significant potential to drive societal progress.
However, their deployment often faces obstacles due to substantial safety concerns. Safe …

A review of safe reinforcement learning: Methods, theory and applications

S Gu, L Yang, Y Du, G Chen, F Walter, J Wang… - arXiv preprint arXiv …, 2022 - arxiv.org
Reinforcement learning (RL) has achieved tremendous success in many complex decision
making tasks. When it comes to deploying RL in the real world, safety concerns are usually …

Guard: A safe reinforcement learning benchmark

W Zhao, R Chen, Y Sun, R Liu, T Wei, C Liu - arXiv preprint arXiv …, 2023 - arxiv.org
Due to the trial-and-error nature, it is typically challenging to apply RL algorithms to safety-
critical real-world applications, such as autonomous driving, human-robot interaction, robot …

Towards safe reinforcement learning with a safety editor policy

H Yu, W Xu, H Zhang - Advances in Neural Information …, 2022 - proceedings.neurips.cc
We consider the safe reinforcement learning (RL) problem of maximizing utility with
extremely low constraint violation rates. Assuming no prior knowledge or pre-training of the …

Evaluating model-free reinforcement learning toward safety-critical tasks

L Zhang, Q Zhang, L Shen, B Yuan, X Wang… - Proceedings of the AAAI …, 2023 - ojs.aaai.org
Safety comes first in many real-world applications involving autonomous agents. Despite a
large number of reinforcement learning (RL) methods focusing on safety-critical tasks, there …

Conservative and adaptive penalty for model-based safe reinforcement learning

YJ Ma, A Shen, O Bastani, J Dinesh - Proceedings of the AAAI …, 2022 - ojs.aaai.org
Reinforcement Learning (RL) agents in the real world must satisfy safety constraints in
addition to maximizing a reward objective. Model-based RL algorithms hold promise for …

Safe dreamerv3: Safe reinforcement learning with world models

W Huang, J Ji, B Zhang, C Xia, Y Yang - arXiv preprint arXiv:2307.07176, 2023 - arxiv.org
The widespread application of Reinforcement Learning (RL) in real-world situations is yet to
come to fruition, largely as a result of its failure to satisfy the essential safety demands of …

Datasets and benchmarks for offline safe reinforcement learning

Z Liu, Z Guo, H Lin, Y Yao, J Zhu, Z Cen, H Hu… - arXiv preprint arXiv …, 2023 - arxiv.org
This paper presents a comprehensive benchmarking suite tailored to offline safe
reinforcement learning (RL) challenges, aiming to foster progress in the development and …

Feasible actor-critic: Constrained reinforcement learning for ensuring statewise safety

H Ma, Y Guan, SE Li, X Zhang, S Zheng… - arXiv preprint arXiv …, 2021 - arxiv.org
The safety constraints commonly used by existing safe reinforcement learning (RL) methods
are defined only on expectation of initial states, but allow each certain state to be unsafe …