Verifiable and compositional reinforcement learning systems

Đ Žikelić, M Lechner, A Verma… - Advances in …, 2024 - proceedings.neurips.cc

Reinforcement learning has shown promising results in learning neural network policies for
complicated control tasks. However, the lack of formal guarantees about the behavior of …

被引用次数：4 相关文章所有 5 个版本

[PDF] arxiv.org

Parameter Synthesis for Markov Models: Covering the Parameter Space

S Junges, E Ábrahám, C Hensel, N Jansen… - arXiv preprint arXiv …, 2019 - arxiv.org

Markov chain analysis is a key technique in formal verification. A practical obstacle is that all
probabilities in Markov models need to be known. However, system quantities such as …

被引用次数：54 相关文章所有 5 个版本

[PDF] arxiv.org

Neurosymbolic motion and task planning for linear temporal logic tasks

X Sun, Y Shoukry - IEEE Transactions on Robotics, 2024 - ieeexplore.ieee.org

This article presents a neurosymbolic framework to solve motion planning problems for
mobile robots involving temporal goals. The temporal goals are described using temporal …

被引用次数：8 相关文章所有 4 个版本

[PDF] springer.com

Compositional Value Iteration with Pareto Caching

K Watanabe, M Vegt, S Junges, I Hasuo - International Conference on …, 2024 - Springer

The de-facto standard approach in MDP verification is based on value iteration (VI). We
propose compositional VI, a framework for model checking compositional MDPs, that …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

Ablation study of how run time assurance impacts the training and performance of reinforcement learning agents

N Hamilton, K Dunlap, TT Johnson… - 2023 IEEE 9th …, 2023 - ieeexplore.ieee.org

Reinforcement Learning (RL) has become an increasingly important research area as the
success of machine learning algorithms and methods grows. To combat the safety concerns …

被引用次数：10 相关文章所有 5 个版本

[PDF] springer.com

Pareto Curves for Compositionally Model Checking String Diagrams of MDPs

K Watanabe, M van der Vegt, I Hasuo, J Rot… - … Conference on Tools …, 2024 - Springer

Computing schedulers that optimize reachability probabilities in MDPs is a standard
verification task. To address scalability concerns, we focus on MDPs that are compositionally …

被引用次数：5 相关文章所有 6 个版本

[PDF] arxiv.org

Verified compositions of neural network controllers for temporal logic control objectives

J Wang, S Kalluraya, Y Kantaros - 2022 IEEE 61st Conference …, 2022 - ieeexplore.ieee.org

This paper presents a new approach to design verified compositions of Neural Network (NN)
controllers for autonomous systems with tasks captured by Linear Temporal Logic (LTL) …

被引用次数：6 相关文章所有 4 个版本

[PDF] springer.com

Parameter synthesis for Markov models: covering the parameter space

S Junges, E Ábrahám, C Hensel, N Jansen… - Formal Methods in …, 2024 - Springer

Markov chain analysis is a key technique in formal verification. A practical obstacle is that all
probabilities in Markov models need to be known. However, system quantities such as …

被引用次数：2 相关文章所有 6 个版本

A Multifidelity Sim-to-Real Pipeline for Verifiable and Compositional Reinforcement Learning

C Neary, C Ellis, AS Samyal… - … on Robotics and …, 2024 - ieeexplore.ieee.org

We propose and demonstrate a compositional framework for training and verifying
reinforcement learning (RL) systems within a multifidelity sim-to-real pipeline, in order to …

被引用次数：1 相关文章所有 2 个版本

[PDF] ieee.org

Compositional reinforcement learning for discrete-time stochastic control systems

A Lavaei, M Perez, M Kazemi… - IEEE Open Journal …, 2023 - ieeexplore.ieee.org

We propose a compositional approach to synthesize policies for networks of continuous-
space stochastic control systems with unknown dynamics using model-free reinforcement …

被引用次数：5 相关文章所有 4 个版本

高级搜索

QQ 群