Policy optimization with linear temporal logic constraints

M Perez, F Somenzi, A Trivedi - … of the AAAI Conference on Artificial …, 2024 - ojs.aaai.org

Linear temporal logic (LTL) and omega-regular objectives---a superset of LTL---have seen
recent use as a way to express non-Markovian objectives in reinforcement learning. We …

被引用次数：4 相关文章所有 3 个版本

[PDF] mlr.press

Eventual discounting temporal logic counterfactual experience replay

C Voloshin, A Verma, Y Yue - International Conference on …, 2023 - proceedings.mlr.press

Linear temporal logic (LTL) offers a simplified way of specifying tasks for policy optimization
that may otherwise be difficult to describe with scalar reward functions. However, the …

被引用次数：5 相关文章所有 12 个版本

[PDF] arxiv.org

Deep Policy Optimization with Temporal Logic Constraints

A Shah, C Voloshin, C Yang, A Verma… - arXiv preprint arXiv …, 2024 - arxiv.org

Temporal logics, such as linear temporal logic (LTL), offer a precise means of specifying
tasks for (deep) reinforcement learning (RL) agents. In our work, we consider the setting …

被引用次数：2 相关文章所有 2 个版本

[PDF] arxiv.org

Using experience classification for training non-Markovian tasks

R Miao, X Lu, C Tian, B Yu, J Cui, Z Duan - Expert Systems with …, 2024 - Elsevier

Abstract Unlike standard Reinforcement Learning (RL) model, many real-world tasks are
non-Markovian, which requires long-term memory and dependency. Hence solving a non …

Directed Exploration in Reinforcement Learning from Linear Temporal Logic

M Bagatella, A Krause, G Martius - arXiv preprint arXiv:2408.09495, 2024 - arxiv.org

Linear temporal logic (LTL) is a powerful language for task specification in reinforcement
learning, as it allows describing objectives beyond the expressivity of conventional …

[PDF] arxiv.org

高级搜索

QQ 群