Learning optimal feedback operators and their polynomial approximation

K Kunisch, D Vásquez-Varas, D Walter - arXiv preprint arXiv:2208.14120, 2022 - arxiv.org
A learning based method for obtaining feedback laws for nonlinear optimal control problems
is proposed. The learning problem is posed such that the open loop value function is its …

Learning optimal feedback operators and their sparse polynomial approximations

K Kunisch, D Vásquez-Varas, D Walter - Journal of Machine Learning …, 2023 - jmlr.org
A learning based method for obtaining feedback laws for nonlinear optimal control problems
is proposed. The learning problem is posed such that the open loop value function is its …

[HTML][HTML] Optimal polynomial feedback laws for finite horizon control problems

K Kunisch, D Vásquez-Varas - Computers & Mathematics with Applications, 2023 - Elsevier
A learning technique for finite horizon optimal control problems and its approximation based
on polynomials is analyzed. It allows to circumvent, in part, the curse dimensionality which is …

The Projected Bellman Equation in Reinforcement Learning

S Meyn - IEEE Transactions on Automatic Control, 2024 - ieeexplore.ieee.org
Q-learning has become an important part of the reinforcement learning toolkit since its
introduction in the dissertation of Chris Watkins in the 1980 s. In the original tabular …

Neural Control Systems

P Colusso, D Filipović - arXiv preprint arXiv:2404.13967, 2024 - arxiv.org
We propose a function-learning methodology with a control-theoretical foundation. We
parametrise the approximating function as the solution to a control system on a reproducing …

[PDF][PDF] A convergent reinforcement learning algorithm in the continuous case based on a finite difference method

R Munos - IJCAI (2), 1997 - ri.cmu.edu
In this paper, we propose a convergent Reinforcement Learning algorithm for solving
optimal control problems for which the state space and the time are continuous variables …

[PDF][PDF] Active and passive learning in control processes

T Banek, E Kozłowski - Proc. 15th Int. Conf. System Science, II, 2004 - researchgate.net
We consider the optimal control problem for a discrete time stochastic system yi+ 1= f (ξ, yi,
ui)+ σ (ξ, yi) wi+ 1 where the ui are controls, wi are the system disturbances, and ξ represents …

G-Learning: Equivariant Indirect Optimal Control with Generating Function

T Lee - 2023 62nd IEEE Conference on Decision and Control …, 2023 - ieeexplore.ieee.org
This paper presents a new formulation of data-driven, learning-based optimal control with
the Hamilton-Jacobi theory. In contrast to the common practice of rein-forcement learning for …

[HTML][HTML] Imitation learning of stabilizing policies for nonlinear systems

S East - European Journal of Control, 2022 - Elsevier
There has been a recent interest in imitation learning methods that are guaranteed to
produce a stabilizing control law with respect to a known system. Work in this area has …

Direct training method for a continuous-time nonlinear optimal feedback controller

NJ Edwards, CJ Goh - Journal of optimization theory and applications, 1995 - Springer
The solutions of most nonlinear optimal control problems are given in the form of open-loop
optimal control which is computed from a given fixed initial condition. Optimal feedback …