Reinforcement learning for temporal logic control synthesis with probabilistic satisfaction...

I Momennejad, H Hasanbeig… - Advances in …, 2024 - proceedings.neurips.cc

Recently an influx of studies claims emergent cognitive abilities in large language models
(LLMs). Yet, most rely on anecdotes, overlook contamination of training sets, or lack …

被引用次数：49 相关文章所有 6 个版本

[PDF] jair.org Full View

Reward machines: Exploiting reward function structure in reinforcement learning

RT Icarte, TQ Klassen, R Valenzano… - Journal of Artificial …, 2022 - jair.org

Reinforcement learning (RL) methods usually treat reward functions as black boxes. As
such, these methods must extensively interact with the environment in order to discover …

被引用次数：255 相关文章所有 10 个版本

[PDF] arxiv.org

Formal synthesis of controllers for safety-critical autonomous systems: Developments and challenges

X Yin, B Gao, X Yu - Annual Reviews in Control, 2024 - Elsevier

In recent years, formal methods have been extensively used in the design of autonomous
systems. By employing mathematically rigorous techniques, formal methods can provide …

被引用次数：9 相关文章所有 5 个版本

[PDF] neurips.cc

Compositional reinforcement learning from logical specifications

K Jothimurugan, S Bansal… - Advances in Neural …, 2021 - proceedings.neurips.cc

We study the problem of learning control policies for complex tasks given by logical
specifications. Recent approaches automatically generate a reward function from a given …

被引用次数：98 相关文章所有 15 个版本

[PDF] dagstuhl.de

Safe reinforcement learning using probabilistic shields

N Jansen, B Könighofer, S Junges… - 31st International …, 2020 - drops.dagstuhl.de

This paper concerns the efficient construction of a safety shield for reinforcement learning.
We specifically target scenarios that incorporate uncertainty and use Markov decision …

被引用次数：123 相关文章所有 8 个版本

[PDF] hal.science

Control for smart systems: Challenges and trends in smart cities

QS Jia, H Panetto, M Macchi, S Siri, G Weichhart… - Annual Reviews in …, 2022 - Elsevier

There have been tremendous developments in theories and technologies in control for
smart systems. In this paper we review applications to various systems that are crucial for the …

被引用次数：25 相关文章所有 4 个版本

[PDF] arxiv.org

Modular deep reinforcement learning for continuous motion planning with temporal logic

M Cai, M Hasanbeig, S Xiao, A Abate… - IEEE Robotics and …, 2021 - ieeexplore.ieee.org

This letter investigates the motion planning of autonomous dynamical systems modeled by
Markov decision processes (MDP) with unknown transition probabilities over continuous …

被引用次数：93 相关文章所有 14 个版本

[PDF] arxiv.org

Cautious reinforcement learning with logical constraints

M Hasanbeig, A Abate, D Kroening - arXiv preprint arXiv:2002.12156, 2020 - arxiv.org

This paper presents the concept of an adaptive safe padding that forces Reinforcement
Learning (RL) to synthesise optimal control policies while ensuring safety during the …

被引用次数：108 相关文章所有 8 个版本

[PDF] ox.ac.uk

Deep reinforcement learning with temporal logics

M Hasanbeig, D Kroening, A Abate - … and Analysis of Timed Systems: 18th …, 2020 - Springer

The combination of data-driven learning methods with formal reasoning has seen a surge of
interest, as either area has the potential to bolstering the other. For instance, formal methods …

被引用次数：75 相关文章所有 5 个版本

[PDF] neurips.cc

Compositional policy learning in stochastic control systems with formal guarantees

Đ Žikelić, M Lechner, A Verma… - Advances in …, 2024 - proceedings.neurips.cc

Reinforcement learning has shown promising results in learning neural network policies for
complicated control tasks. However, the lack of formal guarantees about the behavior of …

被引用次数：10 相关文章所有 5 个版本

高级搜索

QQ 群