Z Mhammedi, DJ Foster… - … Conference on Machine …, 2023 - proceedings.mlr.press
We study the design of sample-efficient algorithms for reinforcement learning in the presence of rich, high-dimensional observations, formalized via the Block MDP problem …
A Agarwal, T Zhang - Advances in Neural Information …, 2022 - proceedings.neurips.cc
We propose a general framework to design posterior sampling methods for model-based RL. We show that the proposed algorithms can be analyzed by reducing regret to Hellinger …
In recent years, domains such as natural language processing and image recognition have popularized the paradigm of using large datasets to pretrain representations that can be …
E Minasyan, P Gradu… - Advances in Neural …, 2021 - proceedings.neurips.cc
We study online control of time-varying linear systems with unknown dynamics in the nonstochastic control model. At a high level, we demonstrate that this setting is\emph …
A Block, M Simchowitz… - Advances in Neural …, 2024 - proceedings.neurips.cc
The problem of piecewise affine (PWA) regression and planning is of foundational importance to the study of online learning, control, and robotics, where it provides a …
Y Sattar, S Oymak - Journal of Machine Learning Research, 2022 - jmlr.org
We consider the problem of learning a nonlinear dynamical system governed by a nonlinear state equation ht+ 1= ϕ (ht, ut; θ)+ wt. Here θ is the unknown system dynamics, ht is the …
Y Chen, HV Poor - International conference on machine …, 2022 - proceedings.mlr.press
We study the problem of learning a mixture of multiple linear dynamical systems (LDSs) from unlabeled short sample trajectories, each generated by one of the LDS models. Despite the …
A Agarwal, T Zhang - Conference on Learning Theory, 2022 - proceedings.mlr.press
Abstract Provably sample-efficient Reinforcement Learning (RL) with rich observations and function approximation has witnessed tremendous recent progress, particularly when the …
L Cui, T Basar, ZP Jiang - Learning for Dynamics and Control …, 2023 - proceedings.mlr.press
In this paper, we propose a robust reinforcement learning method for a class of linear discrete-time systems to handle model mismatches that may be induced by sim-to-real gap …