作者
Qinmin Yang, Jonathan Blake Vance, Sarangapani Jagannathan
发表日期
2008/8
期刊
IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics)
卷号
38
期号
4
页码范围
994-1001
出版商
IEEE
简介
A nonaffine discrete-time system represented by the nonlinear autoregressive moving average with eXogenous input (NARMAX) representation with unknown nonlinear system dynamics is considered. An equivalent affinelike representation in terms of the tracking error dynamics is first obtained from the original nonaffine nonlinear discrete-time system so that reinforcement-learning-based near-optimal neural network (NN) controller can be developed. The control scheme consists of two linearly parameterized NNs. One NN is designated as the critic NN, which approximates a predefined long-term cost function, and an action NN is employed to derive a near-optimal control signal for the system to track a desired trajectory while minimizing the cost function simultaneously. The NN weights are tuned online. By using the standard Lyapunov approach, the stability of the closed-loop system is shown. The net result is a …
引用总数
200920102011201220132014201520162017201820192020202120222023517111181155411321
学术搜索中的文章