Online safety assurance for deep reinforcement learning

NH Rotman, M Schapira, A Tamar - arXiv preprint arXiv:2010.03625, 2020 - arxiv.org
Recently, deep learning has been successfully applied to a variety of networking problems.
A fundamental challenge is that when the operational environment for a learning …

Online safety assurance for learning-augmented systems

NH Rotman, M Schapira, A Tamar - … of the 19th ACM Workshop on Hot …, 2020 - dl.acm.org
Recently, deep learning has been successfully applied to a variety of networking problems.
A fundamental challenge is that when the operational environment for a learning …

Verification-Guided Shielding for Deep Reinforcement Learning

D Corsi, G Amir, A Rodriguez, C Sanchez… - arXiv preprint arXiv …, 2024 - arxiv.org
In recent years, Deep Reinforcement Learning (DRL) has emerged as an effective approach
to solving real-world tasks. However, despite their successes, DRL-based policies suffer …

Safe deep reinforcement learning by verifying task-level properties

E Marchesini, L Marzari, A Farinelli, C Amato - arXiv preprint arXiv …, 2023 - arxiv.org
Cost functions are commonly employed in Safe Deep Reinforcement Learning (DRL).
However, the cost is typically encoded as an indicator function due to the difficulty of …

BET: Explaining Deep Reinforcement Learning through The Error-Prone Decisions

X Liu, J Zhao, W Chen, M Tan, Y Su - arXiv preprint arXiv:2401.07263, 2024 - arxiv.org
Despite the impressive capabilities of Deep Reinforcement Learning (DRL) agents in many
challenging scenarios, their black-box decision-making process significantly limits their …

Risk sensitive dead-end identification in safety-critical offline reinforcement learning

TW Killian, S Parbhoo, M Ghassemi - arXiv preprint arXiv:2301.05664, 2023 - arxiv.org
In safety-critical decision-making scenarios being able to identify worst-case outcomes, or
dead-ends is crucial in order to develop safe and reliable policies in practice. These …

Safe reinforcement learning using advantage-based intervention

NC Wagener, B Boots… - … Conference on Machine …, 2021 - proceedings.mlr.press
Many sequential decision problems involve finding a policy that maximizes total reward
while obeying safety constraints. Although much recent research has focused on the …

DSMC Evaluation Stages: Fostering Robust and Safe Behavior in Deep Reinforcement Learning–Extended Version

TP Gros, J Groß, D Höller, J Hoffmann… - ACM Transactions on …, 2023 - dl.acm.org
Neural networks (NN) are gaining importance in sequential decision-making. Deep
reinforcement learning (DRL), in particular, is extremely successful in learning action …

Dsmc evaluation stages: Fostering robust and safe behavior in deep reinforcement learning

TP Gros, D Höller, J Hoffmann, M Klauck… - … Evaluation of Systems …, 2021 - Springer
Neural networks (NN) are gaining importance in sequential decision-making. Deep
reinforcement learning (DRL), in particular, is extremely successful in learning action …

Dynamic Model Predictive Shielding for Provably Safe Reinforcement Learning

A Banerjee, K Rahmani, J Biswas, I Dillig - arXiv preprint arXiv …, 2024 - arxiv.org
Among approaches for provably safe reinforcement learning, Model Predictive Shielding
(MPS) has proven effective at complex tasks in continuous, high-dimensional state spaces …