Stochastic safety for Markov chains

R Wisniewski, ML Bujorianu - IEEE Transactions on Automatic …, 2023 - ieeexplore.ieee.org

This article aims to incorporate safety specifications into Markov decision processes.
Explicitly, we address the minimization problem up to a stopping time with safety constraints …

被引用次数：7 相关文章所有 2 个版本

[PDF] arxiv.org

Safe reinforcement learning for constrained markov decision processes with stochastic stopping time

A Mazumdar, R Wisniewski, ML Bujorianu - arXiv preprint arXiv …, 2024 - arxiv.org

In this paper, we present an online reinforcement learning algorithm for constrained Markov
decision processes with a safety constraint. Despite the necessary attention of the scientific …

被引用次数：1 相关文章所有 3 个版本

[PDF] ucl.ac.uk

Online learning of safety function for Markov decision processes

A Mazumdar, R Wisniewski… - 2023 European Control …, 2023 - ieeexplore.ieee.org

In this paper, we aim to study safety specifications for a Markov decision process with
stochastic stopping time in an almost model-free setting. Our approach involves …

被引用次数：2 相关文章所有 4 个版本

[PDF] strath.ac.uk

Stochastic safety for random dynamical systems

ML Bujorianu, R Wisniewski… - 2021 American Control …, 2021 - ieeexplore.ieee.org

In the paper, we study the so-called p-safety of a random dynamical system. We generalize
the existing results for safety barrier certificates for deterministic dynamical systems and …

被引用次数：4 相关文章所有 6 个版本

Stochastic Safety of Hybrid Markov Chains

ML Bujorianu, R Wisniewski, A Mazumdar - IFAC-PapersOnLine, 2024 - Elsevier

In this paper we study the stochastic safety problem for a class of Markov chains generated
by iterated function systems (IFS). These Markov chains represent a mathematical construct …

[PDF] arxiv.org

On Bellman's principle of optimality and Reinforcement learning for safety-constrained Markov decision process

R Misra, R Wisniewski, CS Kallesøe - arXiv preprint arXiv:2302.13152, 2023 - arxiv.org

We study optimality for the safety-constrained Markov decision process which is the
underlying framework for safe reinforcement learning. Specifically, we consider a …

Safe Dynamic Programming

R Wisniewski, ML Bujorianu - arXiv preprint arXiv:2109.03307, 2021 - arxiv.org

We incorporate safety specifications into dynamic programming. Explicitly, we address the
minimization problem of a Markov decision process up to a stopping time with safety …

被引用次数：1 相关文章所有 2 个版本

[PDF] ucl.ac.uk

From MDP to POMDP and Back: Safety and Compositionality

ML Bujorianu, T Caulfield, D Pym… - 2023 European …, 2023 - ieeexplore.ieee.org

We propose a compositional framework for the stochastic safety of distributed Markov
Decision Processes (MDPs) and Partially Observable Markov Decision Processes …

Cyber-Physical Ecosystems: Modelling and Verification

ML Bujorianu - International Conference on Engineering of Computer …, 2023 - Springer

In this paper, we set up a mathematical framework for the modelling and verification of
complex cyber-physical ecosystems. In our setting, cyber-physical ecosystems are cyber …

高级搜索

QQ 群