Conservative safety critics for exploration

Y Kim, H Oh, J Lee, J Choi, G Ji, M Jung… - IEEE Transactions …, 2024 - ieeexplore.ieee.org

Several earlier studies have shown impressive control performance in complex robotic
systems by designing the controller using a neural network and training it with model-free …

被引用次数：13 相关文章所有 3 个版本

[PDF] neurips.cc

Constraint-conditioned policy optimization for versatile safe reinforcement learning

Y Yao, Z Liu, Z Cen, J Zhu, W Yu… - Advances in Neural …, 2024 - proceedings.neurips.cc

Safe reinforcement learning (RL) focuses on training reward-maximizing agents subject to
pre-defined safety constraints. Yet, learning versatile safe policies that can adapt to varying …

被引用次数：5 相关文章所有 8 个版本

[PDF] arxiv.org

Probabilistic constraint for safety-critical reinforcement learning

W Chen, D Subramanian… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

In this paper, we consider the problem of learning safe policies for probabilistic-constrained
reinforcement learning (RL). Specifically, a safe policy or controller is one that, with high …

被引用次数：10 相关文章所有 4 个版本

[PDF] arxiv.org

A Survey of Constraint Formulations in Safe Reinforcement Learning

A Wachi, X Shen, Y Sui - arXiv preprint arXiv:2402.02025, 2024 - arxiv.org

Ensuring safety is critical when applying reinforcement learning (RL) to real-world problems.
Consequently, safe RL emerges as a fundamental and powerful paradigm for safely …

被引用次数：1 相关文章所有 2 个版本

[PDF] neurips.cc

Cross-domain policy adaptation via value-guided data filtering

K Xu, C Bai, X Ma, D Wang, B Zhao… - Advances in …, 2024 - proceedings.neurips.cc

Generalizing policies across different domains with dynamics mismatch poses a significant
challenge in reinforcement learning. For example, a robot learns the policy in a simulator …

被引用次数：6 相关文章所有 5 个版本

[PDF] neurips.cc

Trust region-based safe distributional reinforcement learning for multiple constraints

D Kim, K Lee, S Oh - Advances in neural information …, 2024 - proceedings.neurips.cc

In safety-critical robotic tasks, potential failures must be reduced, and multiple constraints
must be met, such as avoiding collisions, limiting energy consumption, and maintaining …

被引用次数：8 相关文章所有 6 个版本

[PDF] neurips.cc

Risk-averse model uncertainty for distributionally robust safe reinforcement learning

J Queeney, M Benosman - Advances in Neural Information …, 2024 - proceedings.neurips.cc

Many real-world domains require safe decision making in uncertain environments. In this
work, we introduce a deep reinforcement learning framework for approaching this important …

被引用次数：7 相关文章所有 8 个版本

CVaR-Constrained Policy Optimization for Safe Reinforcement Learning

Q Zhang, S Leng, X Ma, Q Liu, X Wang… - … on Neural Networks …, 2024 - ieeexplore.ieee.org

Current constrained reinforcement learning (RL) methods guarantee constraint satisfaction
only in expectation, which is inadequate for safety-critical decision problems. Since a …

被引用次数：2 相关文章所有 3 个版本

[PDF] acm.org

Enhancing Safety in Learning from Demonstration Algorithms via Control Barrier Function Shielding

Y Yang, L Chen, Z Zaidi, S van Waveren… - Proceedings of the …, 2024 - dl.acm.org

Learning from Demonstration (LfD) is a powerful method for non-roboticists end-users to
teach robots new tasks, enabling them to customize the robot behavior. However, modern …

被引用次数：1 相关文章所有 2 个版本

Shielded Planning Guided Data-Efficient and Safe Reinforcement Learning

H Wang, J Qin, Z Kan - IEEE Transactions on Neural Networks …, 2024 - ieeexplore.ieee.org

Safe reinforcement learning (RL) has shown great potential for building safe general-
purpose robotic systems. While many existing works have focused on post-training policy …

被引用次数：1 相关文章所有 3 个版本

高级搜索

QQ 群