Random sampling of states in dynamic programming

S Levine, V Koltun - International conference on machine …, 2013 - proceedings.mlr.press

Direct policy search can effectively scale to high-dimensional systems, but complex policies
with hundreds of parameters often present a challenge for such methods, requiring …

被引用次数：1397 相关文章所有 27 个版本

[PDF] berkeley.edu

LQR-trees: Feedback motion planning via sums-of-squares verification

R Tedrake, IR Manchester… - … Journal of Robotics …, 2010 - journals.sagepub.com

Advances in the direct computation of Lyapunov functions using convex optimization make it
possible to efficiently evaluate regions of attraction for smooth non-linear systems. Here we …

被引用次数：615 相关文章所有 11 个版本

[PDF] cmu.edu

Learning control in robotics

S Schaal, CG Atkeson - IEEE Robotics & Automation Magazine, 2010 - ieeexplore.ieee.org

Recent trends in robot learning are to use trajectory-based optimal control techniques and
reinforcement learning to scale complex robotic systems. On the one hand, increased …

被引用次数：227 相关文章所有 10 个版本

[PDF] mit.edu

LQR-trees: Feedback motion planning on sparse randomized trees.

R Tedrake - Robotics: Science and Systems, 2009 - direct.mit.edu

Recent advances in the direct computation of Lyapunov functions using convex optimization
make it possible to efficiently evaluate regions of stability for smooth nonlinear systems …

被引用次数：234 相关文章所有 14 个版本

[PDF] ubc.ca

Robust task-based control policies for physics-based characters

S Coros, P Beaudoin, M Van de Panne - ACM SIGGRAPH Asia 2009 …, 2009 - dl.acm.org

We present a method for precomputing robust task-based control policies for physically
simulated characters. This allows for characters that can demonstrate skill and purpose in …

被引用次数：167 相关文章所有 9 个版本

[PDF] researchgate.net

[图书][B] Neural approximations for optimal control and decision

R Zoppoli, M Sanguineti, G Gnecco, T Parisini - 2020 - Springer

Many scientific and technological areas of major interest require one to solve infinite-
dimensional optimization problems, also called functional optimization problems. In such a …

被引用次数：67 相关文章所有 9 个版本

[PDF] sjtu.edu.cn

Standing balance control using a trajectory library

C Liu, CG Atkeson - 2009 IEEE/RSJ International Conference …, 2009 - ieeexplore.ieee.org

This paper presents a standing balance controller that explicitly handles pushes. We employ
a library of optimal trajectories and the neighboring optimal control method to generate local …

被引用次数：125 相关文章所有 12 个版本

[PDF] cmu.edu

Multiple balance strategies from one optimization criterion

CG Atkeson, B Stephens - 2007 7th IEEE-RAS International …, 2007 - ieeexplore.ieee.org

Multiple strategies for standing balance have been observed in humans, including using the
ankles to apply torque to the ground, using the hips and/or arms to generate horizontal …

被引用次数：121 相关文章所有 11 个版本

[PDF] uliege.be

Cross-entropy optimization of control policies with adaptive basis functions

L Busoniu, D Ernst, B De Schutter… - IEEE Transactions on …, 2010 - ieeexplore.ieee.org

This paper introduces an algorithm for direct search of control policies in continuous-state
discrete-action Markov decision processes. The algorithm looks for the best closed-loop …

被引用次数：98 相关文章所有 17 个版本

[PDF] nsf.gov

Min-max differential dynamic programming: Continuous and discrete time formulations

W Sun, Y Pan, J Lim, EA Theodorou… - Journal of Guidance …, 2018 - arc.aiaa.org

In this work, the first min-max Game-Theoretic Differential Dynamic Programming (GT-DDP)
algorithm in continuous time is derived. A set of backward differential equations for the value …

被引用次数：34 相关文章所有 5 个版本

高级搜索

QQ 群