作者
Xiao Li, Cristian-Ioan Vasile, Calin Belta
发表日期
2017/9/24
研讨会论文
2017 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
页码范围
3834-3839
出版商
IEEE
简介
Reinforcement learning (RL) depends critically on the choice of reward functions used to capture the desired behavior and constraints of a robot. Usually, these are handcrafted by a expert designer and represent heuristics for relatively simple tasks. Real world applications typically involve more complex tasks with rich temporal and logical structure. In this paper we take advantage of the expressive power of temporal logic (TL) to specify complex rules the robot should follow, and incorporate domain knowledge into learning. We propose Truncated Linear Temporal Logic (TLTL) as a specification language, We propose Truncated Linear Temporal Logic (TLTL) as a specification language, that is arguably well suited for the robotics applications, We show in simulated trials that learning is faster and policies obtained using the proposed approach outperform the ones learned using heuristic rewards in terms of the …
引用总数
20172018201920202021202220232024313232530475720
学术搜索中的文章
X Li, CI Vasile, C Belta - 2017 IEEE/RSJ International Conference on Intelligent …, 2017