相关文章- 学术资源搜索

Identification and adaptive control of markov jump systems: Sample complexity and regret bounds

Y Sattar, Z Du, DA Tarzanagh, L Balzano… - arXiv preprint arXiv …, 2021 - arxiv.org

Learning how to effectively control unknown dynamical systems is crucial for intelligent
autonomous systems. This task becomes a significant challenge when the underlying …

被引用次数：23 相关文章所有 6 个版本

[PDF] nsf.gov

Data-driven control of markov jump systems: Sample complexity and regret bounds

Z Du, Y Sattar, DA Tarzanagh, L Balzano… - 2022 American …, 2022 - ieeexplore.ieee.org

Learning how to effectively control unknown dynamical systems from data is crucial for
intelligent autonomous systems. This task becomes a significant challenge when the …

被引用次数：7 相关文章所有 5 个版本

[PDF] arxiv.org

A general framework for learning-based distributionally robust MPC of Markov jump systems

M Schuurmans, P Patrinos - IEEE Transactions on Automatic …, 2023 - ieeexplore.ieee.org

In this article, we present a data-driven learning model predictive control (MPC) scheme for
chance-constrained Markov jump systems with unknown switching probabilities. Using …

被引用次数：21 相关文章所有 4 个版本

[PDF] mlr.press

Policy learning of MDPs with mixed continuous/discrete variables: A case study on model-free control of Markovian jump systems

JP Jansch-Porto, B Hu… - Learning for Dynamics …, 2020 - proceedings.mlr.press

Markovian jump linear systems (MJLS) are an important class of dynamical systems that
arise in many control applications. In this paper, we introduce the problem of controlling …

被引用次数：21 相关文章所有 7 个版本

[PDF] arxiv.org

Certainty equivalent quadratic control for markov jump systems

Z Du, Y Sattar, DA Tarzanagh, L Balzano… - arXiv preprint arXiv …, 2021 - arxiv.org

Real-world control applications often involve complex dynamics subject to abrupt changes
or variations. Markov jump linear systems (MJS) provide a rich framework for modeling such …

被引用次数：10 相关文章所有 4 个版本

[PDF] mlr.press

Thompson Sampling Achieves Regret in Linear Quadratic Control

T Kargin, S Lale, K Azizzadenesheli… - … on Learning Theory, 2022 - proceedings.mlr.press

Thompson Sampling (TS) is an efficient method for decision-making under uncertainty,
where an action is sampled from a carefully prescribed distribution which is updated based …

被引用次数：14 相关文章所有 7 个版本

Learning-based stabilization of Markov jump linear systems

JJR Liu, M Ogura, Q Li, J Lam - Neurocomputing, 2024 - Elsevier

In this paper, we explore the stabilization problem of discrete-time Markov jump linear
systems from a new perspective. We establish a novel learning-based framework that …

被引用次数：1 相关文章

[PDF] arxiv.org

Convergence guarantees of policy optimization methods for Markovian jump linear systems

JP Jansch-Porto, B Hu… - 2020 American Control …, 2020 - ieeexplore.ieee.org

Recently, policy optimization for control purposes has received renewed attention due to the
increasing interest in reinforcement learning. In this paper, we investigate the convergence …

被引用次数：39 相关文章所有 5 个版本

[PDF] arxiv.org

Regret lower bounds for learning linear quadratic gaussian systems

I Ziemann, H Sandberg - arXiv preprint arXiv:2201.01680, 2022 - arxiv.org

TWe establish regret lower bounds for adaptively controlling an unknown linear Gaussian
system with quadratic costs. We combine ideas from experiment design, estimation theory …

被引用次数：18 相关文章所有 2 个版本

[PDF] arxiv.org

Data-driven distributionally robust control of partially observable jump linear systems

M Schuurmans, P Patrinos - 2021 60th IEEE Conference on …, 2021 - ieeexplore.ieee.org

We study data-driven control of (Markov) jump linear systems with unknown transition
probabilities, where both the discrete mode and the continuous state are to be inferred from …

被引用次数：7 相关文章所有 4 个版本

高级搜索

QQ 群

Identification and adaptive control of markov jump systems: Sample complexity and regret bounds

Data-driven control of markov jump systems: Sample complexity and regret bounds

A general framework for learning-based distributionally robust MPC of Markov jump systems

Policy learning of MDPs with mixed continuous/discrete variables: A case study on model-free control of Markovian jump systems

Certainty equivalent quadratic control for markov jump systems

Thompson Sampling Achieves Regret in Linear Quadratic Control

Learning-based stabilization of Markov jump linear systems

Convergence guarantees of policy optimization methods for Markovian jump linear systems

Regret lower bounds for learning linear quadratic gaussian systems

Data-driven distributionally robust control of partially observable jump linear systems

相关搜索

引用