相关文章- 学术资源搜索

Learning optimal feedback operators and their polynomial approximation

K Kunisch, D Vásquez-Varas, D Walter - arXiv preprint arXiv:2208.14120, 2022 - arxiv.org

A learning based method for obtaining feedback laws for nonlinear optimal control problems
is proposed. The learning problem is posed such that the open loop value function is its …

被引用次数：3 相关文章所有 2 个版本

[PDF] jmlr.org

Learning optimal feedback operators and their sparse polynomial approximations

K Kunisch, D Vásquez-Varas, D Walter - Journal of Machine Learning …, 2023 - jmlr.org

A learning based method for obtaining feedback laws for nonlinear optimal control problems
is proposed. The learning problem is posed such that the open loop value function is its …

被引用次数：4 相关文章所有 2 个版本

[HTML] sciencedirect.com

[HTML][HTML] Optimal polynomial feedback laws for finite horizon control problems

K Kunisch, D Vásquez-Varas - Computers & Mathematics with Applications, 2023 - Elsevier

A learning technique for finite horizon optimal control problems and its approximation based
on polynomials is analyzed. It allows to circumvent, in part, the curse dimensionality which is …

被引用次数：2 相关文章所有 4 个版本

The Projected Bellman Equation in Reinforcement Learning

S Meyn - IEEE Transactions on Automatic Control, 2024 - ieeexplore.ieee.org

Q-learning has become an important part of the reinforcement learning toolkit since its
introduction in the dissertation of Chris Watkins in the 1980 s. In the original tabular …

[PDF] arxiv.org

Neural Control Systems

P Colusso, D Filipović - arXiv preprint arXiv:2404.13967, 2024 - arxiv.org

We propose a function-learning methodology with a control-theoretical foundation. We
parametrise the approximating function as the solution to a control system on a reproducing …

[PDF][PDF] A convergent reinforcement learning algorithm in the continuous case based on a finite difference method

R Munos - IJCAI (2), 1997 - ri.cmu.edu

In this paper, we propose a convergent Reinforcement Learning algorithm for solving
optimal control problems for which the state space and the time are continuous variables …

被引用次数：26 相关文章所有 8 个版本

[PDF] researchgate.net

[PDF][PDF] Active and passive learning in control processes

T Banek, E Kozłowski - Proc. 15th Int. Conf. System Science, II, 2004 - researchgate.net

We consider the optimal control problem for a discrete time stochastic system yi+ 1= f (ξ, yi,
ui)+ σ (ξ, yi) wi+ 1 where the ui are controls, wi are the system disturbances, and ξ represents …

被引用次数：4 相关文章所有 2 个版本

G-Learning: Equivariant Indirect Optimal Control with Generating Function

T Lee - 2023 62nd IEEE Conference on Decision and Control …, 2023 - ieeexplore.ieee.org

This paper presents a new formulation of data-driven, learning-based optimal control with
the Hamilton-Jacobi theory. In contrast to the common practice of rein-forcement learning for …

[HTML][HTML] Imitation learning of stabilizing policies for nonlinear systems

S East - European Journal of Control, 2022 - Elsevier

There has been a recent interest in imitation learning methods that are guaranteed to
produce a stabilizing control law with respect to a known system. Work in this area has …

被引用次数：2 相关文章所有 5 个版本

[PDF] psu.edu

Direct training method for a continuous-time nonlinear optimal feedback controller

NJ Edwards, CJ Goh - Journal of optimization theory and applications, 1995 - Springer

The solutions of most nonlinear optimal control problems are given in the form of open-loop
optimal control which is computed from a given fixed initial condition. Optimal feedback …

被引用次数：32 相关文章所有 6 个版本

高级搜索

QQ 群