A survey on policy search algorithms for learning robot controllers in a handful of trials

K Chatzilygeroudis, V Vassiliades… - IEEE Transactions …, 2019 - ieeexplore.ieee.org
Most policy search (PS) algorithms require thousands of training episodes to find an
effective policy, which is often infeasible with a physical robot. This survey article focuses on …

A survey on policy search for robotics

MP Deisenroth, G Neumann… - Foundations and Trends …, 2013 - nowpublishers.com
Policy search is a subfield in reinforcement learning which focuses on finding good
parameters for a given policy parametrization. It is well suited for robotics as it can cope with …

Black-box data-efficient policy search for robotics

K Chatzilygeroudis, R Rama, R Kaushik… - 2017 IEEE/RSJ …, 2017 - ieeexplore.ieee.org
The most data-efficient algorithms for reinforcement learning (RL) in robotics are based on
uncertain dynamical models: after each episode, they first learn a dynamical model of the …

[HTML][HTML] Model-based contextual policy search for data-efficient generalization of robot skills

A Kupcsik, MP Deisenroth, J Peters, AP Loh… - Artificial Intelligence, 2017 - Elsevier
In robotics, lower-level controllers are typically used to make the robot solve a specific task
in a fixed context. For example, the lower-level controller can encode a hitting movement …

Domain randomization for simulation-based policy optimization with transferability assessment

F Muratore, F Treede, M Gienger… - Conference on Robot …, 2018 - proceedings.mlr.press
Exploration-based reinforcement learning on real robot systems is generally time-intensive
and can lead to catastrophic robot failures. Therefore, simulation-based policy search …

Data-efficient generalization of robot skills with contextual policy search

A Kupcsik, M Deisenroth, J Peters… - Proceedings of the AAAI …, 2013 - ojs.aaai.org
In robotics, controllers make the robot solve a task within a specific context. The context can
describe the objectives of the robot or physical properties of the environment and is always …

Simple random search provides a competitive approach to reinforcement learning

H Mania, A Guy, B Recht - arXiv preprint arXiv:1803.07055, 2018 - arxiv.org
A common belief in model-free reinforcement learning is that methods based on random
search in the parameter space of policies exhibit significantly worse sample complexity than …

Gaussian processes for data-efficient learning in robotics and control

MP Deisenroth, D Fox… - IEEE transactions on …, 2013 - ieeexplore.ieee.org
Autonomous learning has been a promising direction in control and robotics for more than a
decade since data-driven learning allows to reduce the amount of engineering knowledge …

Robot skill learning: From reinforcement learning to evolution strategies

F Stulp, O Sigaud - Paladyn, Journal of Behavioral Robotics, 2013 - degruyter.com
Policy improvement methods seek to optimize the parameters of a policy with respect to a
utility function. Owing to current trends involving searching in parameter space (rather than …

Real-world reinforcement learning via multifidelity simulators

M Cutler, TJ Walsh, JP How - IEEE Transactions on Robotics, 2015 - ieeexplore.ieee.org
Reinforcement learning (RL) can be a tool for designing policies and controllers for robotic
systems. However, the cost of real-world samples remains prohibitive as many RL …