LTL and Beyond: Formal Languages for Reward Function Specification in Reinforcement Learning.

S Liu, A Trivedi, X Yin, M Zamani - Annual Reviews in Control, 2022 - Elsevier

Correct-by-construction synthesis is a cornerstone of the confluence of formal methods and
control theory towards designing safety-critical systems. Instead of following the time-tested …

被引用次数：50 相关文章所有 11 个版本

[PDF] jair.org Full View

Reward machines: Exploiting reward function structure in reinforcement learning

RT Icarte, TQ Klassen, R Valenzano… - Journal of Artificial …, 2022 - jair.org

Reinforcement learning (RL) methods usually treat reward functions as black boxes. As
such, these methods must extensively interact with the environment in order to discover …

被引用次数：254 相关文章所有 10 个版本

[PDF] arxiv.org

A survey on interpretable reinforcement learning

C Glanois, P Weng, M Zimmer, D Li, T Yang, J Hao… - Machine Learning, 2024 - Springer

Although deep reinforcement learning has become a promising machine learning approach
for sequential decision-making problems, it is still not mature enough for high-stake domains …

被引用次数：99 相关文章所有 3 个版本

[PDF] neurips.cc

Compositional reinforcement learning from logical specifications

K Jothimurugan, S Bansal… - Advances in Neural …, 2021 - proceedings.neurips.cc

We study the problem of learning control policies for complex tasks given by logical
specifications. Recent approaches automatically generate a reward function from a given …

被引用次数：98 相关文章所有 15 个版本

[PDF] neurips.cc

Learning reward machines for partially observable reinforcement learning

R Toro Icarte, E Waldie, T Klassen… - Advances in neural …, 2019 - proceedings.neurips.cc

Abstract Reward Machines (RMs), originally proposed for specifying problems in
Reinforcement Learning (RL), provide a structured, automata-based representation of a …

被引用次数：153 相关文章所有 9 个版本

[PDF] arxiv.org

Modular deep reinforcement learning for continuous motion planning with temporal logic

M Cai, M Hasanbeig, S Xiao, A Abate… - IEEE Robotics and …, 2021 - ieeexplore.ieee.org

This letter investigates the motion planning of autonomous dynamical systems modeled by
Markov decision processes (MDP) with unknown transition probabilities over continuous …

被引用次数：93 相关文章所有 14 个版本

[PDF] aaai.org

Symbolic plans as high-level instructions for reinforcement learning

L Illanes, X Yan, RT Icarte, SA McIlraith - Proceedings of the …, 2020 - ojs.aaai.org

Reinforcement learning (RL) agents seek to maximize the cumulative reward obtained when
interacting with their environment. Users define tasks or goals for RL agents by designing …

被引用次数：127 相关文章所有 11 个版本

[PDF] mlr.press

Ltl2action: Generalizing ltl instructions for multi-task rl

P Vaezipoor, AC Li, RAT Icarte… - … on Machine Learning, 2021 - proceedings.mlr.press

We address the problem of teaching a deep reinforcement learning (RL) agent to follow
instructions in multi-task environments. Instructions are expressed in a well-known formal …

被引用次数：89 相关文章所有 9 个版本

[PDF] aaai.org

Compositional approach to translate LTLf/LDLf into deterministic finite automata

G De Giacomo, M Favorito - Proceedings of the International …, 2021 - ojs.aaai.org

The translation from temporal logics to automata is the workhorse algorithm of several
techniques in computer science and AI, such as reactive synthesis, reasoning about actions …

被引用次数：72 相关文章所有 11 个版本

[PDF] arxiv.org

Reinforcement learning with knowledge representation and reasoning: A brief survey

C Yu, X Zheng, HH Zhuo, H Wan, W Luo - arXiv preprint arXiv:2304.12090, 2023 - arxiv.org

Reinforcement Learning (RL) has achieved tremendous development in recent years, but
still faces significant obstacles in addressing complex real-life problems due to the issues of …

被引用次数：9 相关文章所有 2 个版本

高级搜索

QQ 群