A shrinkage principle for heavy-tailed data: High-dimensional robust low-rank matrix recovery

Q Sun, WX Zhou, J Fan - Journal of the American Statistical …, 2020 - Taylor & Francis

Big data can easily be contaminated by outliers or contain variables with heavy-tailed
distributions, which makes many conventional methods inadequate. To address this …

被引用次数：324 相关文章所有 15 个版本

[PDF] arxiv.org

Robust estimation via robust gradient estimation

A Prasad, AS Suggala, S Balakrishnan… - Journal of the Royal …, 2020 - academic.oup.com

We provide a new computationally efficient class of estimators for risk minimization. We
show that these estimators are robust for general statistical models, under varied robustness …

被引用次数：259 相关文章所有 10 个版本

[PDF] semanticscholar.org

[图书][B] Statistical foundations of data science

J Fan, R Li, CH Zhang, H Zou - 2020 - taylorfrancis.com

Statistical Foundations of Data Science gives a thorough introduction to commonly used
statistical models, contemporary statistical machine learning techniques and algorithms …

被引用次数：227 相关文章所有 9 个版本

[HTML] nih.gov

[HTML][HTML] Distributed estimation of principal eigenspaces

J Fan, D Wang, K Wang, Z Zhu - Annals of statistics, 2019 - ncbi.nlm.nih.gov

Principal component analysis (PCA) is fundamental to statistical machine learning. It extracts
latent principal factors that contribute to the most variation of the data. When data are stored …

被引用次数：212 相关文章所有 12 个版本

[PDF] arxiv.org

Recent developments in factor models and applications in econometric learning

J Fan, K Li, Y Liao - Annual Review of Financial Economics, 2021 - annualreviews.org

This article provides a selective overview of the recent developments in factor models and
their applications in econometric learning. We focus on the perspective of the low-rank …

被引用次数：28 相关文章所有 13 个版本

[PDF] acm.org

High dimensional differentially private stochastic optimization with heavy-tailed data

L Hu, S Ni, H Xiao, D Wang - Proceedings of the 41st ACM SIGMOD …, 2022 - dl.acm.org

As one of the most fundamental problems in machine learning, statistics and differential
privacy, Differentially Private Stochastic Convex Optimization (DP-SCO) has been …

被引用次数：51 相关文章所有 7 个版本

[PDF] projecteuclid.org

Sub-Gaussian estimators of the mean of a random matrix with heavy-tailed entries

S Minsker - The Annals of Statistics, 2018 - JSTOR

Estimation of the covariance matrix has attracted a lot of attention of the statistical research
community over the years, partially due to important applications such as principal …

被引用次数：111 相关文章所有 9 个版本

[PDF] sciencedirect.com

Generalized high-dimensional trace regression via nuclear norm regularization

J Fan, W Gong, Z Zhu - Journal of econometrics, 2019 - Elsevier

We study the generalized trace regression with a near low-rank regression coefficient matrix,
which extends notion of sparsity for regression coefficient vectors. Specifically, given a …

被引用次数：77 相关文章所有 12 个版本

[PDF] siam.org

ISLET: Fast and optimal low-rank tensor regression via importance sketching

AR Zhang, Y Luo, G Raskutti, M Yuan - SIAM journal on mathematics of data …, 2020 - SIAM

In this paper, we develop a novel procedure for low-rank tensor regression, namely
Importance Sketching Low-rank Estimation for Tensors (ISLET). The central idea behind …

被引用次数：61 相关文章所有 5 个版本

[PDF] arxiv.org

Principal component analysis for big data

J Fan, Q Sun, WX Zhou, Z Zhu - arXiv preprint arXiv:1801.01602, 2018 - arxiv.org

Big data is transforming our world, revolutionizing operations and analytics everywhere,
from financial engineering to biomedical sciences. The complexity of big data often makes …

被引用次数：77 相关文章所有 9 个版本

高级搜索

QQ 群