Systematic generalisation through task temporal logic and deep reinforcement learning

RT Icarte, TQ Klassen, R Valenzano… - Journal of Artificial …, 2022 - jair.org

Reinforcement learning (RL) methods usually treat reward functions as black boxes. As
such, these methods must extensively interact with the environment in order to discover …

被引用次数：259 相关文章所有 10 个版本

[PDF] mlr.press

Ltl2action: Generalizing ltl instructions for multi-task rl

P Vaezipoor, AC Li, RAT Icarte… - … on Machine Learning, 2021 - proceedings.mlr.press

We address the problem of teaching a deep reinforcement learning (RL) agent to follow
instructions in multi-task environments. Instructions are expressed in a well-known formal …

被引用次数：89 相关文章所有 9 个版本

[PDF] arxiv.org

Reinforcement learning with knowledge representation and reasoning: A brief survey

C Yu, X Zheng, HH Zhuo, H Wan, W Luo - arXiv preprint arXiv:2304.12090, 2023 - arxiv.org

Reinforcement Learning (RL) has achieved tremendous development in recent years, but
still faces significant obstacles in addressing complex real-life problems due to the issues of …

被引用次数：9 相关文章所有 2 个版本

[PDF] github.io

Task-driven reinforcement learning with action primitives for long-horizon manipulation skills

H Wang, H Zhang, L Li, Z Kan… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

It is an interesting open problem to enable robots to efficiently and effectively learn long-
horizon manipulation skills. Motivated to augment robot learning via more effective …

被引用次数：11 相关文章所有 3 个版本

[PDF] neurips.cc

Learning to follow instructions in text-based games

M Tuli, A Li, P Vaezipoor, T Klassen… - Advances in …, 2022 - proceedings.neurips.cc

Text-based games present a unique class of sequential decision making problem in which
agents interact with a partially observable, simulated environment via actions and …

被引用次数：14 相关文章所有 10 个版本

[PDF] arxiv.org

Lifelong reinforcement learning with temporal logic formulas and reward machines

X Zheng, C Yu, M Zhang - Knowledge-Based Systems, 2022 - Elsevier

Continuously learning new tasks using high-level ideas or knowledge is a key capability of
humans. In this paper, we propose lifelong reinforcement learning with sequential linear …

被引用次数：18 相关文章所有 6 个版本

[PDF] arxiv.org

Reinforcement Learning of Action and Query Policies with LTL Instructions under Uncertain Event Detector

W Hatanaka, R Yamashina… - IEEE Robotics and …, 2023 - ieeexplore.ieee.org

Reinforcement learning (RL) with linear temporal logic (LTL) objectives can allow robots to
carry out symbolic event plans in unknown environments. Most existing methods assume …

被引用次数：3 相关文章所有 3 个版本

[PDF] mlr.press

Hierarchies of reward machines

D Furelos-Blanco, M Law, A Jonsson… - International …, 2023 - proceedings.mlr.press

Reward machines (RMs) are a recent formalism for representing the reward function of a
reinforcement learning task through a finite-state machine whose edges encode subgoals of …

被引用次数：16 相关文章所有 9 个版本

[PDF] arxiv.org

Skill transfer for temporally-extended task specifications

JX Liu, A Shah, E Rosen, G Konidaris… - arXiv preprint arXiv …, 2022 - arxiv.org

Deploying robots in real-world domains, such as households and flexible manufacturing
lines, requires the robots to be taskable on demand. Linear temporal logic (LTL) is a widely …

被引用次数：14 相关文章

[PDF] easychair.org

Generalization of temporal logic tasks via future dependent options

D Xu, F Fekri - Machine Learning, 2024 - Springer

Temporal logic (TL) tasks consist of complex and temporally extended subgoals and they
are common for many real-world applications, such as service and navigation robots …

被引用次数：2 相关文章所有 2 个版本

高级搜索

QQ 群