Compositional policy learning in stochastic control systems with formal guarantees

Đ Žikelić, M Lechner, A Verma… - Advances in …, 2024 - proceedings.neurips.cc
Reinforcement learning has shown promising results in learning neural network policies for
complicated control tasks. However, the lack of formal guarantees about the behavior of …

Parameter Synthesis for Markov Models: Covering the Parameter Space

S Junges, E Ábrahám, C Hensel, N Jansen… - arXiv preprint arXiv …, 2019 - arxiv.org
Markov chain analysis is a key technique in formal verification. A practical obstacle is that all
probabilities in Markov models need to be known. However, system quantities such as …

Neurosymbolic motion and task planning for linear temporal logic tasks

X Sun, Y Shoukry - IEEE Transactions on Robotics, 2024 - ieeexplore.ieee.org
This article presents a neurosymbolic framework to solve motion planning problems for
mobile robots involving temporal goals. The temporal goals are described using temporal …

Compositional Value Iteration with Pareto Caching

K Watanabe, M Vegt, S Junges, I Hasuo - International Conference on …, 2024 - Springer
The de-facto standard approach in MDP verification is based on value iteration (VI). We
propose compositional VI, a framework for model checking compositional MDPs, that …

Ablation study of how run time assurance impacts the training and performance of reinforcement learning agents

N Hamilton, K Dunlap, TT Johnson… - 2023 IEEE 9th …, 2023 - ieeexplore.ieee.org
Reinforcement Learning (RL) has become an increasingly important research area as the
success of machine learning algorithms and methods grows. To combat the safety concerns …

Pareto Curves for Compositionally Model Checking String Diagrams of MDPs

K Watanabe, M van der Vegt, I Hasuo, J Rot… - … Conference on Tools …, 2024 - Springer
Computing schedulers that optimize reachability probabilities in MDPs is a standard
verification task. To address scalability concerns, we focus on MDPs that are compositionally …

Verified compositions of neural network controllers for temporal logic control objectives

J Wang, S Kalluraya, Y Kantaros - 2022 IEEE 61st Conference …, 2022 - ieeexplore.ieee.org
This paper presents a new approach to design verified compositions of Neural Network (NN)
controllers for autonomous systems with tasks captured by Linear Temporal Logic (LTL) …

Parameter synthesis for Markov models: covering the parameter space

S Junges, E Ábrahám, C Hensel, N Jansen… - Formal Methods in …, 2024 - Springer
Markov chain analysis is a key technique in formal verification. A practical obstacle is that all
probabilities in Markov models need to be known. However, system quantities such as …

A Multifidelity Sim-to-Real Pipeline for Verifiable and Compositional Reinforcement Learning

C Neary, C Ellis, AS Samyal… - … on Robotics and …, 2024 - ieeexplore.ieee.org
We propose and demonstrate a compositional framework for training and verifying
reinforcement learning (RL) systems within a multifidelity sim-to-real pipeline, in order to …

Compositional reinforcement learning for discrete-time stochastic control systems

A Lavaei, M Perez, M Kazemi… - IEEE Open Journal …, 2023 - ieeexplore.ieee.org
We propose a compositional approach to synthesize policies for networks of continuous-
space stochastic control systems with unknown dynamics using model-free reinforcement …