An algorithmic perspective on imitation learning

T Osa, J Pajarinen, G Neumann… - … and Trends® in …, 2018 - nowpublishers.com
Foundations and Trends® in Robotics, 2018nowpublishers.com
As robots and other intelligent agents move from simple environments and problems to more
complex, unstructured settings, manually programming their behavior has become
increasingly challenging and expensive. Often, it is easier for a teacher to demonstrate a
desired behavior rather than attempt to manually engineer it. This process of learning from
demonstrations, and the study of algorithms to do so, is called imitation learning. This work
provides an introduction to imitation learning. It covers the underlying assumptions …
Abstract
As robots and other intelligent agents move from simple environments and problems to more complex, unstructured settings, manually programming their behavior has become increasingly challenging and expensive. Often, it is easier for a teacher to demonstrate a desired behavior rather than attempt to manually engineer it. This process of learning from demonstrations, and the study of algorithms to do so, is called imitation learning. This work provides an introduction to imitation learning. It covers the underlying assumptions, approaches, and how they relate; the rich set of algorithms developed to tackle the problem; and advice on effective tools and implementation. We intend this paper to serve two audiences. First, we want to familiarize machine learning experts with the challenges of imitation learning, particularly those arising in robotics, and the interesting theoretical and practical distinctions between it and more familiar frameworks like statistical supervised learning theory and reinforcement learning. Second, we want to give roboticists and experts in applied artificial intelligence a broader appreciation for the frameworks and tools available for imitation learning. We pay particular attention to the intimate connection between imitation learning approaches and those of structured prediction Daumé III et al.[2009]. To structure this discussion, we categorize imitation learning techniques based on the following key criteria which drive algorithmic decisions:
nowpublishers.com
以上显示的是最相近的搜索结果。 查看全部搜索结果