Random sampling of states in dynamic programming- 学术资源搜索

文章

学术资源搜索

Random sampling of states in dynamic programming

C Atkeson, B Stephens - Advances in neural information …, 2007 - proceedings.neurips.cc

Advances in neural information processing systems, 2007•proceedings.neurips.cc

We combine two threads of research on approximate dynamic programming: random
sampling of states and using local trajectory optimizers to globally optimize a policy and
associated value function. This combination allows us to replace a dense multidimensional
grid with a much sparser adaptive sampling of states. Our focus is on finding steady state
policies for the deterministic time invariant discrete time control problems with continuous
states and actions often found in robotics. In this paper we show that we can now solve …

Abstract

We combine two threads of research on approximate dynamic programming: random sampling of states and using local trajectory optimizers to globally optimize a policy and associated value function. This combination allows us to replace a dense multidimensional grid with a much sparser adaptive sampling of states. Our focus is on finding steady state policies for the deterministic time invariant discrete time control problems with continuous states and actions often found in robotics. In this paper we show that we can now solve problems we couldn't solve previously with regular grid-based approaches.

proceedings.neurips.cc

展开收起

被引用次数：87 相关文章所有 15 个版本

以上显示的是最相近的搜索结果。查看全部搜索结果

高级搜索

QQ 群

Random sampling of states in dynamic programming

引用