State augmented constrained reinforcement learning: Overcoming the limitations of learning...

S Gu, L Yang, Y Du, G Chen, F Walter, J Wang… - arXiv preprint arXiv …, 2022 - arxiv.org

Reinforcement Learning (RL) has achieved tremendous success in many complex decision-
making tasks. However, safety concerns are raised during deploying RL in real-world …

被引用次数：256 相关文章所有 2 个版本

[PDF] neurips.cc

Last-iterate convergent policy gradient primal-dual methods for constrained mdps

D Ding, CY Wei, K Zhang… - Advances in Neural …, 2024 - proceedings.neurips.cc

We study the problem of computing an optimal policy of an infinite-horizon discounted
constrained Markov decision process (constrained MDP). Despite the popularity of …

被引用次数：22 相关文章所有 6 个版本

[PDF] arxiv.org

Probabilistic constraint for safety-critical reinforcement learning

W Chen, D Subramanian… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

In this paper, we consider the problem of learning safe policies for probabilistic-constrained
reinforcement learning (RL). Specifically, a safe policy or controller is one that, with high …

被引用次数：13 相关文章所有 4 个版本

[PDF] neurips.cc

Safe pontryagin differentiable programming

W Jin, S Mou, GJ Pappas - Advances in Neural Information …, 2021 - proceedings.neurips.cc

Abstract We propose a Safe Pontryagin Differentiable Programming (Safe PDP)
methodology, which establishes a theoretical and algorithmic framework to solve a broad …

被引用次数：38 相关文章所有 8 个版本

[PDF] kcl.ac.uk

A Review of Safe Reinforcement Learning: Methods, Theories and Applications

S Gu, L Yang, Y Du, G Chen, F Walter… - … on Pattern Analysis …, 2024 - ieeexplore.ieee.org

Reinforcement Learning (RL) has achieved tremendous success in many complex decision-
making tasks. However, safety concerns are raised during deploying RL in real-world …

State-augmented learnable algorithms for resource management in wireless networks

N NaderiAlizadeh, M Eisen… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

We consider resource management problems in multi-user wireless networks, which can be
cast as optimizing a network-wide utility function, subject to constraints on the long-term …

被引用次数：13 相关文章所有 5 个版本

[PDF] mlr.press

Resilient Constrained Reinforcement Learning

D Ding, Z Huan, A Ribeiro - International Conference on …, 2024 - proceedings.mlr.press

We study a class of constrained reinforcement learning (RL) problems in which multiple
constraint specifications are not identified before training. It is challenging to identify …

被引用次数：2 相关文章所有 3 个版本

[PDF] arxiv.org

Deterministic Policy Gradient Primal-Dual Methods for Continuous-Space Constrained MDPs

S Rozada, D Ding, AG Marques, A Ribeiro - arXiv preprint arXiv …, 2024 - arxiv.org

We study the problem of computing deterministic optimal policies for constrained Markov
decision processes (MDPs) with continuous state and action spaces, which are widely …

被引用次数：1 相关文章所有 3 个版本

Towards Cooperative Driving among Heterogeneous CAVs: A Safe Multi-Agent Reinforcement Learning Approach

Y Pan, J Lei, P Yi, L Guo, H Chen - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

With the advancement of Intelligent Transportation Systems and Vehicle-to-Everything
communication technologies, the future traffic scenario is anticipated to be a mixed …

[PDF] arxiv.org

SCPO: Safe Reinforcement Learning with Safety Critic Policy Optimization

J Mhamed, S Gu - arXiv preprint arXiv:2311.00880, 2023 - arxiv.org

Incorporating safety is an essential prerequisite for broadening the practical applications of
reinforcement learning in real-world scenarios. To tackle this challenge, Constrained Markov …

高级搜索

QQ 群