Not only rewards but also constraints: Applications on legged robot locomotion

Y Kim, H Oh, J Lee, J Choi, G Ji, M Jung… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
Several earlier studies have shown impressive control performance in complex robotic
systems by designing the controller using a neural network and training it with model-free …

Constraint-conditioned policy optimization for versatile safe reinforcement learning

Y Yao, Z Liu, Z Cen, J Zhu, W Yu… - Advances in Neural …, 2024 - proceedings.neurips.cc
Safe reinforcement learning (RL) focuses on training reward-maximizing agents subject to
pre-defined safety constraints. Yet, learning versatile safe policies that can adapt to varying …

Probabilistic constraint for safety-critical reinforcement learning

W Chen, D Subramanian… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
In this paper, we consider the problem of learning safe policies for probabilistic-constrained
reinforcement learning (RL). Specifically, a safe policy or controller is one that, with high …

A Survey of Constraint Formulations in Safe Reinforcement Learning

A Wachi, X Shen, Y Sui - arXiv preprint arXiv:2402.02025, 2024 - arxiv.org
Ensuring safety is critical when applying reinforcement learning (RL) to real-world problems.
Consequently, safe RL emerges as a fundamental and powerful paradigm for safely …

Cross-domain policy adaptation via value-guided data filtering

K Xu, C Bai, X Ma, D Wang, B Zhao… - Advances in …, 2024 - proceedings.neurips.cc
Generalizing policies across different domains with dynamics mismatch poses a significant
challenge in reinforcement learning. For example, a robot learns the policy in a simulator …

Trust region-based safe distributional reinforcement learning for multiple constraints

D Kim, K Lee, S Oh - Advances in neural information …, 2024 - proceedings.neurips.cc
In safety-critical robotic tasks, potential failures must be reduced, and multiple constraints
must be met, such as avoiding collisions, limiting energy consumption, and maintaining …

Risk-averse model uncertainty for distributionally robust safe reinforcement learning

J Queeney, M Benosman - Advances in Neural Information …, 2024 - proceedings.neurips.cc
Many real-world domains require safe decision making in uncertain environments. In this
work, we introduce a deep reinforcement learning framework for approaching this important …

CVaR-Constrained Policy Optimization for Safe Reinforcement Learning

Q Zhang, S Leng, X Ma, Q Liu, X Wang… - … on Neural Networks …, 2024 - ieeexplore.ieee.org
Current constrained reinforcement learning (RL) methods guarantee constraint satisfaction
only in expectation, which is inadequate for safety-critical decision problems. Since a …

Enhancing Safety in Learning from Demonstration Algorithms via Control Barrier Function Shielding

Y Yang, L Chen, Z Zaidi, S van Waveren… - Proceedings of the …, 2024 - dl.acm.org
Learning from Demonstration (LfD) is a powerful method for non-roboticists end-users to
teach robots new tasks, enabling them to customize the robot behavior. However, modern …

Shielded Planning Guided Data-Efficient and Safe Reinforcement Learning

H Wang, J Qin, Z Kan - IEEE Transactions on Neural Networks …, 2024 - ieeexplore.ieee.org
Safe reinforcement learning (RL) has shown great potential for building safe general-
purpose robotic systems. While many existing works have focused on post-training policy …