- 学术资源搜索

Small random initialization is akin to spectral learning: Optimization and generalization guarantees for overparameterized low-rank matrix reconstruction

D Stöger, M Soltanolkotabi - Advances in Neural …, 2021 - proceedings.neurips.cc

Recently there has been significant theoretical progress on understanding the convergence
and generalization of gradient-based methods on nonconvex losses with overparameterized …

被引用次数：70 相关文章所有 9 个版本

[PDF] neurips.cc

Breaking the sample size barrier in model-based reinforcement learning with a generative model

G Li, Y Wei, Y Chi, Y Gu… - Advances in neural …, 2020 - proceedings.neurips.cc

We investigate the sample efficiency of reinforcement learning in a $\gamma $-discounted
infinite-horizon Markov decision process (MDP) with state space S and action space A …

被引用次数：132 相关文章所有 10 个版本

[PDF] mlr.press

The power of preconditioning in overparameterized low-rank matrix sensing

X Xu, Y Shen, Y Chi, C Ma - International Conference on …, 2023 - proceedings.mlr.press

Abstract We propose $\textsf {ScaledGD ($\lambda $)} $, a preconditioned gradient descent
method to tackle the low-rank matrix sensing problem when the true rank is unknown, and …

被引用次数：22 相关文章所有 11 个版本

[PDF] pnas.org Full View

Approximate message passing from random initialization with applications to Z₂ synchronization

G Li, W Fan, Y Wei - … of the National Academy of Sciences, 2023 - National Acad Sciences

This paper is concerned with the problem of reconstructing an unknown rank-one matrix with
prior structural information from noisy observations. While computing the Bayes optimal …

被引用次数：16 相关文章所有 9 个版本

[PDF] arxiv.org

A theory of non-linear feature learning with one gradient step in two-layer neural networks

B Moniri, D Lee, H Hassani, E Dobriban - arXiv preprint arXiv:2310.07891, 2023 - arxiv.org

Feature learning is thought to be one of the fundamental reasons for the success of deep
neural networks. It is rigorously known that in two-layer fully-connected neural networks …

被引用次数：15 相关文章所有 5 个版本

[PDF] arxiv.org

A non-asymptotic framework for approximate message passing in spiked models

G Li, Y Wei - arXiv preprint arXiv:2208.03313, 2022 - arxiv.org

Approximate message passing (AMP) emerges as an effective iterative paradigm for solving
high-dimensional statistical problems. However, prior AMP theory--which focused mostly on …

被引用次数：27 相关文章所有 2 个版本

[HTML] nih.gov

[HTML][HTML] Bridging convex and nonconvex optimization in robust PCA: Noise, outliers, and missing data

Y Chen, J Fan, C Ma, Y Yan - Annals of statistics, 2021 - ncbi.nlm.nih.gov

This paper delivers improved theoretical guarantees for the convex programming approach
in low-rank matrix estimation, in the presence of (1) random noise,(2) gross sparse outliers …

被引用次数：65 相关文章所有 12 个版本

[PDF] neurips.cc

Spectral entry-wise matrix estimation for low-rank reinforcement learning

S Stojanovic, Y Jedra… - Advances in Neural …, 2023 - proceedings.neurips.cc

We study matrix estimation problems arising in reinforcement learning with low-rank
structure. In low-rank bandits, the matrix to be recovered specifies the expected arm …

被引用次数：3 相关文章所有 6 个版本

[PDF] jmlr.org

Scaling and scalability: Provable nonconvex low-rank tensor estimation from incomplete measurements

T Tong, C Ma, A Prater-Bennette, E Tripp… - Journal of Machine …, 2022 - jmlr.org

Tensors, which provide a powerful and flexible model for representing multi-attribute data
and multi-way interactions, play an indispensable role in modern data science across …

被引用次数：39 相关文章所有 7 个版本

[PDF] arxiv.org

Model-based reinforcement learning is minimax-optimal for offline zero-sum markov games

Y Yan, G Li, Y Chen, J Fan - arXiv preprint arXiv:2206.04044, 2022 - arxiv.org

This paper makes progress towards learning Nash equilibria in two-player zero-sum Markov
games from offline data. Specifically, consider a $\gamma $-discounted infinite-horizon …

被引用次数：25 相关文章所有 3 个版本

高级搜索

QQ 群

Small random initialization is akin to spectral learning: Optimization and generalization guarantees for overparameterized low-rank matrix reconstruction

Breaking the sample size barrier in model-based reinforcement learning with a generative model

The power of preconditioning in overparameterized low-rank matrix sensing

Approximate message passing from random initialization with applications to Z₂ synchronization

A theory of non-linear feature learning with one gradient step in two-layer neural networks

A non-asymptotic framework for approximate message passing in spiked models

[HTML][HTML] Bridging convex and nonconvex optimization in robust PCA: Noise, outliers, and missing data

Spectral entry-wise matrix estimation for low-rank reinforcement learning

Scaling and scalability: Provable nonconvex low-rank tensor estimation from incomplete measurements

Model-based reinforcement learning is minimax-optimal for offline zero-sum markov games

引用