Towards provable control for unknown linear dynamical systems

Z Allen-Zhu, Y Li, Z Song - International conference on …, 2019 - proceedings.mlr.press

Deep neural networks (DNNs) have demonstrated dominating performance in many fields;
since AlexNet, networks used in practice are going wider and deeper. On the theoretical …

被引用次数：1589 相关文章所有 5 个版本

[PDF] arxiv.org

Non-asymptotic identification of lti systems from a single trajectory

S Oymak, N Ozay - 2019 American control conference (ACC), 2019 - ieeexplore.ieee.org

We consider the problem of learning a realization for a linear time-invariant (LTI) dynamical
system from input/output data. Given a single input/output trajectory, we provide finite time …

被引用次数：254 相关文章所有 4 个版本

[PDF] mlr.press

Improper learning for non-stochastic control

M Simchowitz, K Singh… - Conference on Learning …, 2020 - proceedings.mlr.press

We consider the problem of controlling a possibly unknown linear dynamical system with
adversarial perturbations, adversarially chosen convex loss functions, and partially …

被引用次数：156 相关文章所有 5 个版本

[PDF] mlr.press

The nonstochastic control problem

E Hazan, S Kakade, K Singh - Algorithmic Learning Theory, 2020 - proceedings.mlr.press

We consider the problem of controlling an unknown linear dynamical system in the presence
of (nonstochastic) adversarial perturbations and adversarial convex loss functions. In …

被引用次数：125 相关文章所有 9 个版本

[PDF] mlr.press

Provable reinforcement learning with a short-term memory

Y Efroni, C Jin, A Krishnamurthy… - … on Machine Learning, 2022 - proceedings.mlr.press

Real-world sequential decision making problems commonly involve partial observability,
which requires the agent to maintain a memory of history in order to infer the latent states …

被引用次数：41 相关文章所有 4 个版本

[PDF] mlr.press

Online linear quadratic control

A Cohen, A Hasidim, T Koren, N Lazic… - International …, 2018 - proceedings.mlr.press

We study the problem of controlling linear time-invariant systems with known noisy dynamics
and adversarially chosen quadratic losses. We present the first efficient online learning …

被引用次数：154 相关文章所有 6 个版本

[PDF] nsf.gov

Revisiting ho–kalman-based system identification: Robustness and finite-sample analysis

S Oymak, N Ozay - IEEE Transactions on Automatic Control, 2021 - ieeexplore.ieee.org

Weconsider the problem of learning a realization for a linear time-invariant (LTI) dynamical
system from input/output data. Given a single input/output trajectory, we provide finite time …

被引用次数：53 相关文章所有 2 个版本

[PDF] mlr.press

Model-free linear quadratic control via reduction to expert prediction

Y Abbasi-Yadkori, N Lazic… - The 22nd International …, 2019 - proceedings.mlr.press

Abstract Model-free approaches for reinforcement learning (RL) and continuous control find
policies based only on past states and rewards, without fitting a model of the system …

被引用次数：96 相关文章所有 9 个版本

[HTML] diva-portal.org

Finite-time identification of linear systems: Fundamental limits and optimal algorithms

Y Jedra, A Proutiere - IEEE Transactions on Automatic Control, 2022 - ieeexplore.ieee.org

We investigate the linear system identification problem in the so-called fixed budget and
fixed confidence settings. In the fixed budget setting, the learner aims at estimating the state …

被引用次数：16 相关文章所有 2 个版本

[PDF] arxiv.org

Sample complexity lower bounds for linear system identification

Y Jedra, A Proutiere - 2019 IEEE 58th Conference on Decision …, 2019 - ieeexplore.ieee.org

This paper establishes problem-specific sample complexity lower bounds for linear system
identification problems. The sample complexity is defined in the PAC framework: it …

被引用次数：52 相关文章所有 5 个版本

高级搜索

QQ 群