相关文章- 学术资源搜索

A survey on policy search algorithms for learning robot controllers in a handful of trials

K Chatzilygeroudis, V Vassiliades… - IEEE Transactions …, 2019 - ieeexplore.ieee.org

Most policy search (PS) algorithms require thousands of training episodes to find an
effective policy, which is often infeasible with a physical robot. This survey article focuses on …

被引用次数：187 相关文章所有 17 个版本

[PDF] nowpublishers.com

A survey on policy search for robotics

MP Deisenroth, G Neumann… - Foundations and Trends …, 2013 - nowpublishers.com

Policy search is a subfield in reinforcement learning which focuses on finding good
parameters for a given policy parametrization. It is well suited for robotics as it can cope with …

被引用次数：1237 相关文章所有 16 个版本

[PDF] arxiv.org

Black-box data-efficient policy search for robotics

K Chatzilygeroudis, R Rama, R Kaushik… - 2017 IEEE/RSJ …, 2017 - ieeexplore.ieee.org

The most data-efficient algorithms for reinforcement learning (RL) in robotics are based on
uncertain dynamical models: after each episode, they first learn a dynamical model of the …

被引用次数：131 相关文章所有 10 个版本

[HTML] sciencedirect.com

[HTML][HTML] Model-based contextual policy search for data-efficient generalization of robot skills

A Kupcsik, MP Deisenroth, J Peters, AP Loh… - Artificial Intelligence, 2017 - Elsevier

In robotics, lower-level controllers are typically used to make the robot solve a specific task
in a fixed context. For example, the lower-level controller can encode a hitting movement …

被引用次数：100 相关文章所有 16 个版本

[PDF] mlr.press

Domain randomization for simulation-based policy optimization with transferability assessment

F Muratore, F Treede, M Gienger… - Conference on Robot …, 2018 - proceedings.mlr.press

Exploration-based reinforcement learning on real robot systems is generally time-intensive
and can lead to catastrophic robot failures. Therefore, simulation-based policy search …

被引用次数：63 相关文章所有 7 个版本

[PDF] aaai.org

Data-efficient generalization of robot skills with contextual policy search

A Kupcsik, M Deisenroth, J Peters… - Proceedings of the AAAI …, 2013 - ojs.aaai.org

In robotics, controllers make the robot solve a task within a specific context. The context can
describe the objectives of the robot or physical properties of the environment and is always …

被引用次数：144 相关文章所有 16 个版本

[PDF] arxiv.org

Simple random search provides a competitive approach to reinforcement learning

H Mania, A Guy, B Recht - arXiv preprint arXiv:1803.07055, 2018 - arxiv.org

A common belief in model-free reinforcement learning is that methods based on random
search in the parameter space of policies exhibit significantly worse sample complexity than …

被引用次数：368 相关文章所有 2 个版本

[PDF] ieee.org

Gaussian processes for data-efficient learning in robotics and control

MP Deisenroth, D Fox… - IEEE transactions on …, 2013 - ieeexplore.ieee.org

Autonomous learning has been a promising direction in control and robotics for more than a
decade since data-driven learning allows to reduce the amount of engineering knowledge …

被引用次数：835 相关文章所有 26 个版本

[PDF] degruyter.com Full View

Robot skill learning: From reinforcement learning to evolution strategies

F Stulp, O Sigaud - Paladyn, Journal of Behavioral Robotics, 2013 - degruyter.com

Policy improvement methods seek to optimize the parameters of a policy with respect to a
utility function. Owing to current trends involving searching in parameter space (rather than …

被引用次数：172 相关文章所有 9 个版本

Real-world reinforcement learning via multifidelity simulators

M Cutler, TJ Walsh, JP How - IEEE Transactions on Robotics, 2015 - ieeexplore.ieee.org

Reinforcement learning (RL) can be a tool for designing policies and controllers for robotic
systems. However, the cost of real-world samples remains prohibitive as many RL …

被引用次数：80 相关文章所有 4 个版本

高级搜索

QQ 群