Reward machines: Exploiting reward function structure in reinforcement learning

RT Icarte, TQ Klassen, R Valenzano… - Journal of Artificial …, 2022 - jair.org
Reinforcement learning (RL) methods usually treat reward functions as black boxes. As
such, these methods must extensively interact with the environment in order to discover …

Ltl2action: Generalizing ltl instructions for multi-task rl

P Vaezipoor, AC Li, RAT Icarte… - … on Machine Learning, 2021 - proceedings.mlr.press
We address the problem of teaching a deep reinforcement learning (RL) agent to follow
instructions in multi-task environments. Instructions are expressed in a well-known formal …

Reinforcement learning with knowledge representation and reasoning: A brief survey

C Yu, X Zheng, HH Zhuo, H Wan, W Luo - arXiv preprint arXiv:2304.12090, 2023 - arxiv.org
Reinforcement Learning (RL) has achieved tremendous development in recent years, but
still faces significant obstacles in addressing complex real-life problems due to the issues of …

Task-driven reinforcement learning with action primitives for long-horizon manipulation skills

H Wang, H Zhang, L Li, Z Kan… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
It is an interesting open problem to enable robots to efficiently and effectively learn long-
horizon manipulation skills. Motivated to augment robot learning via more effective …

Learning to follow instructions in text-based games

M Tuli, A Li, P Vaezipoor, T Klassen… - Advances in …, 2022 - proceedings.neurips.cc
Text-based games present a unique class of sequential decision making problem in which
agents interact with a partially observable, simulated environment via actions and …

Lifelong reinforcement learning with temporal logic formulas and reward machines

X Zheng, C Yu, M Zhang - Knowledge-Based Systems, 2022 - Elsevier
Continuously learning new tasks using high-level ideas or knowledge is a key capability of
humans. In this paper, we propose lifelong reinforcement learning with sequential linear …

Reinforcement Learning of Action and Query Policies with LTL Instructions under Uncertain Event Detector

W Hatanaka, R Yamashina… - IEEE Robotics and …, 2023 - ieeexplore.ieee.org
Reinforcement learning (RL) with linear temporal logic (LTL) objectives can allow robots to
carry out symbolic event plans in unknown environments. Most existing methods assume …

Hierarchies of reward machines

D Furelos-Blanco, M Law, A Jonsson… - International …, 2023 - proceedings.mlr.press
Reward machines (RMs) are a recent formalism for representing the reward function of a
reinforcement learning task through a finite-state machine whose edges encode subgoals of …

Skill transfer for temporally-extended task specifications

JX Liu, A Shah, E Rosen, G Konidaris… - arXiv preprint arXiv …, 2022 - arxiv.org
Deploying robots in real-world domains, such as households and flexible manufacturing
lines, requires the robots to be taskable on demand. Linear temporal logic (LTL) is a widely …

Generalization of temporal logic tasks via future dependent options

D Xu, F Fekri - Machine Learning, 2024 - Springer
Temporal logic (TL) tasks consist of complex and temporally extended subgoals and they
are common for many real-world applications, such as service and navigation robots …