Secure-by-construction synthesis of cyber-physical systems

S Liu, A Trivedi, X Yin, M Zamani - Annual Reviews in Control, 2022 - Elsevier
Correct-by-construction synthesis is a cornerstone of the confluence of formal methods and
control theory towards designing safety-critical systems. Instead of following the time-tested …

Reward machines: Exploiting reward function structure in reinforcement learning

RT Icarte, TQ Klassen, R Valenzano… - Journal of Artificial …, 2022 - jair.org
Reinforcement learning (RL) methods usually treat reward functions as black boxes. As
such, these methods must extensively interact with the environment in order to discover …

A survey on interpretable reinforcement learning

C Glanois, P Weng, M Zimmer, D Li, T Yang, J Hao… - Machine Learning, 2024 - Springer
Although deep reinforcement learning has become a promising machine learning approach
for sequential decision-making problems, it is still not mature enough for high-stake domains …

Compositional reinforcement learning from logical specifications

K Jothimurugan, S Bansal… - Advances in Neural …, 2021 - proceedings.neurips.cc
We study the problem of learning control policies for complex tasks given by logical
specifications. Recent approaches automatically generate a reward function from a given …

Learning reward machines for partially observable reinforcement learning

R Toro Icarte, E Waldie, T Klassen… - Advances in neural …, 2019 - proceedings.neurips.cc
Abstract Reward Machines (RMs), originally proposed for specifying problems in
Reinforcement Learning (RL), provide a structured, automata-based representation of a …

Modular deep reinforcement learning for continuous motion planning with temporal logic

M Cai, M Hasanbeig, S Xiao, A Abate… - IEEE Robotics and …, 2021 - ieeexplore.ieee.org
This letter investigates the motion planning of autonomous dynamical systems modeled by
Markov decision processes (MDP) with unknown transition probabilities over continuous …

Symbolic plans as high-level instructions for reinforcement learning

L Illanes, X Yan, RT Icarte, SA McIlraith - Proceedings of the …, 2020 - ojs.aaai.org
Reinforcement learning (RL) agents seek to maximize the cumulative reward obtained when
interacting with their environment. Users define tasks or goals for RL agents by designing …

Ltl2action: Generalizing ltl instructions for multi-task rl

P Vaezipoor, AC Li, RAT Icarte… - … on Machine Learning, 2021 - proceedings.mlr.press
We address the problem of teaching a deep reinforcement learning (RL) agent to follow
instructions in multi-task environments. Instructions are expressed in a well-known formal …

Compositional approach to translate LTLf/LDLf into deterministic finite automata

G De Giacomo, M Favorito - Proceedings of the International …, 2021 - ojs.aaai.org
The translation from temporal logics to automata is the workhorse algorithm of several
techniques in computer science and AI, such as reactive synthesis, reasoning about actions …

Reinforcement learning with knowledge representation and reasoning: A brief survey

C Yu, X Zheng, HH Zhuo, H Wan, W Luo - arXiv preprint arXiv:2304.12090, 2023 - arxiv.org
Reinforcement Learning (RL) has achieved tremendous development in recent years, but
still faces significant obstacles in addressing complex real-life problems due to the issues of …