Learning the linear quadratic regulator from nonlinear observations

Z Mhammedi, A Block, DJ Foster… - Advances in Neural …, 2024 - proceedings.neurips.cc

A major challenge in reinforcement learning is to develop practical, sample-efficient
algorithms for exploration in high-dimensional domains where generalization and function …

被引用次数：16 相关文章所有 5 个版本

[PDF] mlr.press

Representation learning with multi-step inverse kinematics: An efficient and optimal approach to rich-observation rl

Z Mhammedi, DJ Foster… - … Conference on Machine …, 2023 - proceedings.mlr.press

We study the design of sample-efficient algorithms for reinforcement learning in the
presence of rich, high-dimensional observations, formalized via the Block MDP problem …

被引用次数：21 相关文章所有 6 个版本

[PDF] neurips.cc

Model-based rl with optimistic posterior sampling: Structural conditions and sample complexity

A Agarwal, T Zhang - Advances in Neural Information …, 2022 - proceedings.neurips.cc

We propose a general framework to design posterior sampling methods for model-based
RL. We show that the proposed algorithms can be analyzed by reducing regret to Hellinger …

被引用次数：37 相关文章所有 9 个版本

[PDF] neurips.cc

Inverse dynamics pretraining learns good representations for multitask imitation

D Brandfonbrener, O Nachum… - Advances in Neural …, 2024 - proceedings.neurips.cc

In recent years, domains such as natural language processing and image recognition have
popularized the paradigm of using large datasets to pretrain representations that can be …

被引用次数：16 相关文章所有 7 个版本

[PDF] neurips.cc

Online control of unknown time-varying dynamical systems

E Minasyan, P Gradu… - Advances in Neural …, 2021 - proceedings.neurips.cc

We study online control of time-varying linear systems with unknown dynamics in the
nonstochastic control model. At a high level, we demonstrate that this setting is\emph …

被引用次数：33 相关文章所有 7 个版本

[PDF] neurips.cc

Smoothed online learning for prediction in piecewise affine systems

A Block, M Simchowitz… - Advances in Neural …, 2024 - proceedings.neurips.cc

The problem of piecewise affine (PWA) regression and planning is of foundational
importance to the study of online learning, control, and robotics, where it provides a …

被引用次数：10 相关文章所有 5 个版本

[PDF] jmlr.org

Non-asymptotic and accurate learning of nonlinear dynamical systems

Y Sattar, S Oymak - Journal of Machine Learning Research, 2022 - jmlr.org

We consider the problem of learning a nonlinear dynamical system governed by a nonlinear
state equation ht+ 1= ϕ (ht, ut; θ)+ wt. Here θ is the unknown system dynamics, ht is the …

被引用次数：66 相关文章所有 5 个版本

[PDF] mlr.press

Learning mixtures of linear dynamical systems

Y Chen, HV Poor - International conference on machine …, 2022 - proceedings.mlr.press

We study the problem of learning a mixture of multiple linear dynamical systems (LDSs) from
unlabeled short sample trajectories, each generated by one of the LDS models. Despite the …

被引用次数：23 相关文章所有 5 个版本

[PDF] mlr.press

Non-linear reinforcement learning in large action spaces: Structural conditions and sample-efficiency of posterior sampling

A Agarwal, T Zhang - Conference on Learning Theory, 2022 - proceedings.mlr.press

Abstract Provably sample-efficient Reinforcement Learning (RL) with rich observations and
function approximation has witnessed tremendous recent progress, particularly when the …

被引用次数：18 相关文章所有 5 个版本

[PDF] mlr.press

A reinforcement learning look at risk-sensitive linear quadratic gaussian control

L Cui, T Basar, ZP Jiang - Learning for Dynamics and Control …, 2023 - proceedings.mlr.press

In this paper, we propose a robust reinforcement learning method for a class of linear
discrete-time systems to handle model mismatches that may be induced by sim-to-real gap …

被引用次数：14 相关文章所有 7 个版本

高级搜索

QQ 群