相关文章- 学术资源搜索

Online safety assurance for deep reinforcement learning

NH Rotman, M Schapira, A Tamar - arXiv preprint arXiv:2010.03625, 2020 - arxiv.org

Recently, deep learning has been successfully applied to a variety of networking problems.
A fundamental challenge is that when the operational environment for a learning …

被引用次数：8 相关文章所有 2 个版本

Online safety assurance for learning-augmented systems

NH Rotman, M Schapira, A Tamar - … of the 19th ACM Workshop on Hot …, 2020 - dl.acm.org

Recently, deep learning has been successfully applied to a variety of networking problems.
A fundamental challenge is that when the operational environment for a learning …

被引用次数：17 相关文章

[PDF] arxiv.org

Verification-Guided Shielding for Deep Reinforcement Learning

D Corsi, G Amir, A Rodriguez, C Sanchez… - arXiv preprint arXiv …, 2024 - arxiv.org

In recent years, Deep Reinforcement Learning (DRL) has emerged as an effective approach
to solving real-world tasks. However, despite their successes, DRL-based policies suffer …

Safe deep reinforcement learning by verifying task-level properties

E Marchesini, L Marzari, A Farinelli, C Amato - arXiv preprint arXiv …, 2023 - arxiv.org

Cost functions are commonly employed in Safe Deep Reinforcement Learning (DRL).
However, the cost is typically encoded as an indicator function due to the difficulty of …

被引用次数：9 相关文章所有 5 个版本

[PDF] arxiv.org

BET: Explaining Deep Reinforcement Learning through The Error-Prone Decisions

X Liu, J Zhao, W Chen, M Tan, Y Su - arXiv preprint arXiv:2401.07263, 2024 - arxiv.org

Despite the impressive capabilities of Deep Reinforcement Learning (DRL) agents in many
challenging scenarios, their black-box decision-making process significantly limits their …

Risk sensitive dead-end identification in safety-critical offline reinforcement learning

TW Killian, S Parbhoo, M Ghassemi - arXiv preprint arXiv:2301.05664, 2023 - arxiv.org

In safety-critical decision-making scenarios being able to identify worst-case outcomes, or
dead-ends is crucial in order to develop safe and reliable policies in practice. These …

被引用次数：8 相关文章所有 4 个版本

[PDF] mlr.press

Safe reinforcement learning using advantage-based intervention

NC Wagener, B Boots… - … Conference on Machine …, 2021 - proceedings.mlr.press

Many sequential decision problems involve finding a policy that maximizes total reward
while obeying safety constraints. Although much recent research has focused on the …

被引用次数：49 相关文章所有 6 个版本

[PDF] acm.org

DSMC Evaluation Stages: Fostering Robust and Safe Behavior in Deep Reinforcement Learning–Extended Version

TP Gros, J Groß, D Höller, J Hoffmann… - ACM Transactions on …, 2023 - dl.acm.org

Neural networks (NN) are gaining importance in sequential decision-making. Deep
reinforcement learning (DRL), in particular, is extremely successful in learning action …

被引用次数：3 相关文章

Dsmc evaluation stages: Fostering robust and safe behavior in deep reinforcement learning

TP Gros, D Höller, J Hoffmann, M Klauck… - … Evaluation of Systems …, 2021 - Springer

Neural networks (NN) are gaining importance in sequential decision-making. Deep
reinforcement learning (DRL), in particular, is extremely successful in learning action …

被引用次数：9 相关文章所有 4 个版本

[PDF] arxiv.org

Dynamic Model Predictive Shielding for Provably Safe Reinforcement Learning

A Banerjee, K Rahmani, J Biswas, I Dillig - arXiv preprint arXiv …, 2024 - arxiv.org

Among approaches for provably safe reinforcement learning, Model Predictive Shielding
(MPS) has proven effective at complex tasks in continuous, high-dimensional state spaces …

高级搜索

QQ 群

Online safety assurance for deep reinforcement learning

Online safety assurance for learning-augmented systems

Verification-Guided Shielding for Deep Reinforcement Learning

Safe deep reinforcement learning by verifying task-level properties

BET: Explaining Deep Reinforcement Learning through The Error-Prone Decisions

Risk sensitive dead-end identification in safety-critical offline reinforcement learning

Safe reinforcement learning using advantage-based intervention

DSMC Evaluation Stages: Fostering Robust and Safe Behavior in Deep Reinforcement Learning–Extended Version

Dsmc evaluation stages: Fostering robust and safe behavior in deep reinforcement learning

Dynamic Model Predictive Shielding for Provably Safe Reinforcement Learning

相关搜索

引用