Reward machines: Exploiting reward function structure in reinforcement learning

RT Icarte, TQ Klassen, R Valenzano… - Journal of Artificial …, 2022 - jair.org
Reinforcement learning (RL) methods usually treat reward functions as black boxes. As
such, these methods must extensively interact with the environment in order to discover …

Control barrier functions for signal temporal logic tasks

L Lindemann, DV Dimarogonas - IEEE control systems letters, 2018 - ieeexplore.ieee.org
The need for computationally-efficient control methods of dynamical systems under temporal
logic tasks has recently become more apparent. Existing methods are computationally …

[图书][B] Formal methods for discrete-time dynamical systems

C Belta, B Yordanov, EA Gol - 2017 - Springer
In control theory, complex models of physical processes, such as systems of differential or
difference equations, are usually checked against simple specifications, such as stability …

Formal synthesis of controllers for safety-critical autonomous systems: Developments and challenges

X Yin, B Gao, X Yu - Annual Reviews in Control, 2024 - Elsevier
In recent years, formal methods have been extensively used in the design of autonomous
systems. By employing mathematically rigorous techniques, formal methods can provide …

A survey on interpretable reinforcement learning

C Glanois, P Weng, M Zimmer, D Li, T Yang, J Hao… - Machine Learning, 2024 - Springer
Although deep reinforcement learning has become a promising machine learning approach
for sequential decision-making problems, it is still not mature enough for high-stake domains …

Compositional reinforcement learning from logical specifications

K Jothimurugan, S Bansal… - Advances in Neural …, 2021 - proceedings.neurips.cc
We study the problem of learning control policies for complex tasks given by logical
specifications. Recent approaches automatically generate a reward function from a given …

Reinforcement learning with temporal logic rewards

X Li, CI Vasile, C Belta - 2017 IEEE/RSJ International …, 2017 - ieeexplore.ieee.org
Reinforcement learning (RL) depends critically on the choice of reward functions used to
capture the desired behavior and constraints of a robot. Usually, these are handcrafted by a …

Control synthesis from linear temporal logic specifications using model-free reinforcement learning

AK Bozkurt, Y Wang, MM Zavlanos… - 2020 IEEE International …, 2020 - ieeexplore.ieee.org
We present a reinforcement learning (RL) frame-work to synthesize a control policy from a
given linear temporal logic (LTL) specification in an unknown stochastic environment that …

Modular deep reinforcement learning for continuous motion planning with temporal logic

M Cai, M Hasanbeig, S Xiao, A Abate… - IEEE Robotics and …, 2021 - ieeexplore.ieee.org
This letter investigates the motion planning of autonomous dynamical systems modeled by
Markov decision processes (MDP) with unknown transition probabilities over continuous …

Ltl2action: Generalizing ltl instructions for multi-task rl

P Vaezipoor, AC Li, RAT Icarte… - … on Machine Learning, 2021 - proceedings.mlr.press
We address the problem of teaching a deep reinforcement learning (RL) agent to follow
instructions in multi-task environments. Instructions are expressed in a well-known formal …