On the performance of planning through backpropagation

R Scaroni, TP Bueno, LN de Barros, D Mauá - Intelligent Systems: 9th …, 2020 - Springer
Intelligent Systems: 9th Brazilian Conference, BRACIS 2020, Rio Grande, Brazil …, 2020Springer
Planning problems with continuous state and action spaces are difficult to solve with existing
planning techniques, specially when the state transition is defined by a high-dimension non-
linear dynamics. Recently, a technique called Planning through Backpropagation (PtB) was
introduced as an efficient and scalable alternative to traditional optimization-based methods
for continuous planning problems. PtB leverages modern gradient descent algorithms and
highly optimized automatic differentiation libraries to obtain approximate solutions. However …
Abstract
Planning problems with continuous state and action spaces are difficult to solve with existing planning techniques, specially when the state transition is defined by a high-dimension non-linear dynamics. Recently, a technique called Planning through Backpropagation (PtB) was introduced as an efficient and scalable alternative to traditional optimization-based methods for continuous planning problems. PtB leverages modern gradient descent algorithms and highly optimized automatic differentiation libraries to obtain approximate solutions. However, to date there have been no empirical evaluations comparing PtB with Linear-Quadratic (LQ) control problems. In this work, we compare PtB with an optimal algorithm from control theory called LQR, and its iterative version iLQR, when solving linear and non-linear continuous deterministic planning problems. The empirical results suggest that PtB can be an efficient alternative to optimizing non-linear continuous deterministic planning, being much easier to be implemented and stabilized than classical model-predictive control methods.
Springer
以上显示的是最相近的搜索结果。 查看全部搜索结果