作者
Balázs Varga, Balázs Kulcsár, Morteza Haghir Chehreghani
发表日期
2023/1/10
期刊
International Journal of Robust and Nonlinear Control
卷号
33
期号
1
页码范围
526-544
简介
This work aims at constructing a bridge between robust control theory and reinforcement learning. Although, reinforcement learning has shown admirable results in complex control tasks, the agent's learning behavior is opaque. Meanwhile, system theory has several tools for analyzing and controlling dynamical systems. This article places deep Q‐learning is into a control‐oriented perspective to study its learning dynamics with well‐established techniques from robust control. An uncertain linear time‐invariant model is formulated by means of the neural tangent kernel to describe learning. This novel approach allows giving conditions for stability (convergence) of the learning and enables the analysis of the agent's behavior in frequency‐domain. The control‐oriented approach makes it possible to formulate robust controllers that inject dynamical rewards as control input in the loss function to achieve better …
引用总数
学术搜索中的文章
B Varga, B Kulcsár, MH Chehreghani - International Journal of Robust and Nonlinear Control, 2023