Breaking the sample size barrier in model-based reinforcement learning with a generative model

G Li, Y Wei, Y Chi, Y Gu… - Advances in neural …, 2020 - proceedings.neurips.cc
We investigate the sample efficiency of reinforcement learning in a $\gamma $-discounted
infinite-horizon Markov decision process (MDP) with state space S and action space A …

The power of preconditioning in overparameterized low-rank matrix sensing

X Xu, Y Shen, Y Chi, C Ma - International Conference on …, 2023 - proceedings.mlr.press
Abstract We propose $\textsf {ScaledGD ($\lambda $)} $, a preconditioned gradient descent
method to tackle the low-rank matrix sensing problem when the true rank is unknown, and …

Approximate message passing from random initialization with applications to Z2 synchronization

G Li, W Fan, Y Wei - … of the National Academy of Sciences, 2023 - National Acad Sciences
This paper is concerned with the problem of reconstructing an unknown rank-one matrix with
prior structural information from noisy observations. While computing the Bayes optimal …

A theory of non-linear feature learning with one gradient step in two-layer neural networks

B Moniri, D Lee, H Hassani, E Dobriban - arXiv preprint arXiv:2310.07891, 2023 - arxiv.org
Feature learning is thought to be one of the fundamental reasons for the success of deep
neural networks. It is rigorously known that in two-layer fully-connected neural networks …

A non-asymptotic framework for approximate message passing in spiked models

G Li, Y Wei - arXiv preprint arXiv:2208.03313, 2022 - arxiv.org
Approximate message passing (AMP) emerges as an effective iterative paradigm for solving
high-dimensional statistical problems. However, prior AMP theory--which focused mostly on …

[HTML][HTML] Bridging convex and nonconvex optimization in robust PCA: Noise, outliers, and missing data

Y Chen, J Fan, C Ma, Y Yan - Annals of statistics, 2021 - ncbi.nlm.nih.gov
This paper delivers improved theoretical guarantees for the convex programming approach
in low-rank matrix estimation, in the presence of (1) random noise,(2) gross sparse outliers …

Scaling and scalability: Provable nonconvex low-rank tensor estimation from incomplete measurements

T Tong, C Ma, A Prater-Bennette, E Tripp… - Journal of Machine …, 2022 - jmlr.org
Tensors, which provide a powerful and flexible model for representing multi-attribute data
and multi-way interactions, play an indispensable role in modern data science across …

Model-based reinforcement learning is minimax-optimal for offline zero-sum markov games

Y Yan, G Li, Y Chen, J Fan - arXiv preprint arXiv:2206.04044, 2022 - arxiv.org
This paper makes progress towards learning Nash equilibria in two-player zero-sum Markov
games from offline data. Specifically, consider a $\gamma $-discounted infinite-horizon …

Extending relax-and-round combinatorial optimization solvers with quantum correlations

M Dupont, B Sundar - Physical Review A, 2024 - APS
We introduce a relax-and-round approach embedding the quantum approximate
optimization algorithm (QAOA) with p≥ 1 layers. We show for many problems, including …

Learning mixtures of linear dynamical systems

Y Chen, HV Poor - International conference on machine …, 2022 - proceedings.mlr.press
We study the problem of learning a mixture of multiple linear dynamical systems (LDSs) from
unlabeled short sample trajectories, each generated by one of the LDS models. Despite the …