查看文章

neurips.cc 中的 [PDF]

Policy Optimization with Linear Temporal Logic Constraints

作者

Cameron Voloshin, Hoang Le, Swarat Chaudhuri, Yisong Yue

发表日期

2022/12/6

期刊

Advances in Neural Information Processing Systems

卷号

页码范围

17690-17702

简介

We study the problem of policy optimization (PO) with linear temporal logic (LTL) constraints. The language of LTL allows flexible description of tasks that may be unnatural to encode as a scalar cost function. We consider LTL-constrained PO as a systematic framework, decoupling task specification from policy selection, and an alternative to the standard of cost shaping. With access to a generative model, we develop a model-based approach that enjoys a sample complexity analysis for guaranteeing both task satisfaction and cost optimality (through a reduction to a reachability problem). Empirically, our algorithm can achieve strong performance even in low sample regimes.

引用总数

被引用次数：11

202320242 9

学术搜索中的文章

Policy optimization with linear temporal logic constraints

C Voloshin, H Le, S Chaudhuri, Y Yue - Advances in Neural Information Processing Systems, 2022

被引用次数：11 相关文章所有 12 个版本