Adaptive huber regression

Q Sun, WX Zhou, J Fan - Journal of the American Statistical …, 2020 - Taylor & Francis
Big data can easily be contaminated by outliers or contain variables with heavy-tailed
distributions, which makes many conventional methods inadequate. To address this …

Robust estimation via robust gradient estimation

A Prasad, AS Suggala, S Balakrishnan… - Journal of the Royal …, 2020 - academic.oup.com
We provide a new computationally efficient class of estimators for risk minimization. We
show that these estimators are robust for general statistical models, under varied robustness …

[图书][B] Statistical foundations of data science

J Fan, R Li, CH Zhang, H Zou - 2020 - taylorfrancis.com
Statistical Foundations of Data Science gives a thorough introduction to commonly used
statistical models, contemporary statistical machine learning techniques and algorithms …

[HTML][HTML] Distributed estimation of principal eigenspaces

J Fan, D Wang, K Wang, Z Zhu - Annals of statistics, 2019 - ncbi.nlm.nih.gov
Principal component analysis (PCA) is fundamental to statistical machine learning. It extracts
latent principal factors that contribute to the most variation of the data. When data are stored …

Recent developments in factor models and applications in econometric learning

J Fan, K Li, Y Liao - Annual Review of Financial Economics, 2021 - annualreviews.org
This article provides a selective overview of the recent developments in factor models and
their applications in econometric learning. We focus on the perspective of the low-rank …

High dimensional differentially private stochastic optimization with heavy-tailed data

L Hu, S Ni, H Xiao, D Wang - Proceedings of the 41st ACM SIGMOD …, 2022 - dl.acm.org
As one of the most fundamental problems in machine learning, statistics and differential
privacy, Differentially Private Stochastic Convex Optimization (DP-SCO) has been …

Sub-Gaussian estimators of the mean of a random matrix with heavy-tailed entries

S Minsker - The Annals of Statistics, 2018 - JSTOR
Estimation of the covariance matrix has attracted a lot of attention of the statistical research
community over the years, partially due to important applications such as principal …

Generalized high-dimensional trace regression via nuclear norm regularization

J Fan, W Gong, Z Zhu - Journal of econometrics, 2019 - Elsevier
We study the generalized trace regression with a near low-rank regression coefficient matrix,
which extends notion of sparsity for regression coefficient vectors. Specifically, given a …

ISLET: Fast and optimal low-rank tensor regression via importance sketching

AR Zhang, Y Luo, G Raskutti, M Yuan - SIAM journal on mathematics of data …, 2020 - SIAM
In this paper, we develop a novel procedure for low-rank tensor regression, namely
Importance Sketching Low-rank Estimation for Tensors (ISLET). The central idea behind …

Principal component analysis for big data

J Fan, Q Sun, WX Zhou, Z Zhu - arXiv preprint arXiv:1801.01602, 2018 - arxiv.org
Big data is transforming our world, revolutionizing operations and analytics everywhere,
from financial engineering to biomedical sciences. The complexity of big data often makes …