Q-learning for robust satisfaction of signal temporal logic specifications

RT Icarte, TQ Klassen, R Valenzano… - Journal of Artificial …, 2022 - jair.org

Reinforcement learning (RL) methods usually treat reward functions as black boxes. As
such, these methods must extensively interact with the environment in order to discover …

被引用次数：188 相关文章所有 10 个版本

[PDF] diva-portal.org

Control barrier functions for signal temporal logic tasks

L Lindemann, DV Dimarogonas - IEEE control systems letters, 2018 - ieeexplore.ieee.org

The need for computationally-efficient control methods of dynamical systems under temporal
logic tasks has recently become more apparent. Existing methods are computationally …

被引用次数：309 相关文章所有 5 个版本

[PDF] researchgate.net

[图书][B] Formal methods for discrete-time dynamical systems

C Belta, B Yordanov, EA Gol - 2017 - Springer

In control theory, complex models of physical processes, such as systems of differential or
difference equations, are usually checked against simple specifications, such as stability …

被引用次数：463 相关文章所有 8 个版本

[PDF] arxiv.org

Formal synthesis of controllers for safety-critical autonomous systems: Developments and challenges

X Yin, B Gao, X Yu - Annual Reviews in Control, 2024 - Elsevier

In recent years, formal methods have been extensively used in the design of autonomous
systems. By employing mathematically rigorous techniques, formal methods can provide …

被引用次数：4 相关文章所有 5 个版本

[PDF] arxiv.org

A survey on interpretable reinforcement learning

C Glanois, P Weng, M Zimmer, D Li, T Yang, J Hao… - Machine Learning, 2024 - Springer

Although deep reinforcement learning has become a promising machine learning approach
for sequential decision-making problems, it is still not mature enough for high-stake domains …

被引用次数：67 相关文章所有 3 个版本

[PDF] neurips.cc

Compositional reinforcement learning from logical specifications

K Jothimurugan, S Bansal… - Advances in Neural …, 2021 - proceedings.neurips.cc

We study the problem of learning control policies for complex tasks given by logical
specifications. Recent approaches automatically generate a reward function from a given …

被引用次数：79 相关文章所有 15 个版本

[PDF] arxiv.org

Reinforcement learning with temporal logic rewards

X Li, CI Vasile, C Belta - 2017 IEEE/RSJ International …, 2017 - ieeexplore.ieee.org

Reinforcement learning (RL) depends critically on the choice of reward functions used to
capture the desired behavior and constraints of a robot. Usually, these are handcrafted by a …

被引用次数：230 相关文章所有 8 个版本

[PDF] arxiv.org

Control synthesis from linear temporal logic specifications using model-free reinforcement learning

AK Bozkurt, Y Wang, MM Zavlanos… - 2020 IEEE International …, 2020 - ieeexplore.ieee.org

We present a reinforcement learning (RL) frame-work to synthesize a control policy from a
given linear temporal logic (LTL) specification in an unknown stochastic environment that …

被引用次数：139 相关文章所有 7 个版本

[PDF] arxiv.org

Modular deep reinforcement learning for continuous motion planning with temporal logic

M Cai, M Hasanbeig, S Xiao, A Abate… - IEEE Robotics and …, 2021 - ieeexplore.ieee.org

This letter investigates the motion planning of autonomous dynamical systems modeled by
Markov decision processes (MDP) with unknown transition probabilities over continuous …

被引用次数：84 相关文章所有 14 个版本

[PDF] mlr.press

Ltl2action: Generalizing ltl instructions for multi-task rl

P Vaezipoor, AC Li, RAT Icarte… - … on Machine Learning, 2021 - proceedings.mlr.press

We address the problem of teaching a deep reinforcement learning (RL) agent to follow
instructions in multi-task environments. Instructions are expressed in a well-known formal …

被引用次数：70 相关文章所有 9 个版本

高级搜索

QQ 群