作者
Ning Cao, Huaguang Zhang, Yanhong Luo, Dezhi Feng
发表日期
2012/9/1
期刊
International Journal of Systems Science
卷号
43
期号
9
页码范围
1673-1682
出版商
Taylor & Francis Group
简介
In this article, a novel iteration algorithm named two-stage approximate dynamic programming (TSADP) is proposed to seek the solution of nonlinear switched optimal control problem. At each iteration of TSADP, a multivariate optimal control problem is transformed to be a certain number of univariate optimal control problems. It is shown that the value function at each iteration can be characterised pointwisely by a set of smooth functions recursively obtained from TSADP, and the associated control policy, continuous control and switching control law included, is explicitly provided in a state-feedback form. Moreover, the convergence and optimality of TSADP is strictly proven. To implement this algorithm efficiently, neural networks, critic and action networks, are utilised to approximate the value function and continuous control law, respectively. Thus, the value function is expressed by the weights of critic networks …
引用总数
2013201420152016201720182019202020212022173231412