On the performance of planning through backpropagation- 学术资源搜索

On the performance of planning through backpropagation

R Scaroni, TP Bueno, LN de Barros, D Mauá - Intelligent Systems: 9th …, 2020 - Springer

R Scaroni, TP Bueno, LN de Barros, D Mauá

Intelligent Systems: 9th Brazilian Conference, BRACIS 2020, Rio Grande, Brazil …, 2020•Springer

Abstract

Planning problems with continuous state and action spaces are difficult to solve with existing planning techniques, specially when the state transition is defined by a high-dimension non-linear dynamics. Recently, a technique called Planning through Backpropagation (PtB) was introduced as an efficient and scalable alternative to traditional optimization-based methods for continuous planning problems. PtB leverages modern gradient descent algorithms and highly optimized automatic differentiation libraries to obtain approximate solutions. However, to date there have been no empirical evaluations comparing PtB with Linear-Quadratic (LQ) control problems. In this work, we compare PtB with an optimal algorithm from control theory called LQR, and its iterative version iLQR, when solving linear and non-linear continuous deterministic planning problems. The empirical results suggest that PtB can be an efficient alternative to optimizing non-linear continuous deterministic planning, being much easier to be implemented and stabilized than classical model-predictive control methods.

Springer

展开收起

被引用次数：2 相关文章所有 3 个版本

以上显示的是最相近的搜索结果。查看全部搜索结果

高级搜索

QQ 群

On the performance of planning through backpropagation

引用