Spectral methods for data science: A statistical perspective

Y Chen, Y Chi, J Fan, C Ma - Foundations and Trends® in …, 2021 - nowpublishers.com
Spectral methods have emerged as a simple yet surprisingly effective approach for
extracting information from massive, noisy and incomplete data. In a nutshell, spectral …

Tensor SVD: Statistical and computational limits

A Zhang, D Xia - IEEE Transactions on Information Theory, 2018 - ieeexplore.ieee.org
In this paper, we propose a general framework for tensor singular value decomposition
(tensor singular value decomposition (SVD)), which focuses on the methodology and theory …

Theoretical foundations of t-sne for visualizing high-dimensional clustered data

TT Cai, R Ma - Journal of Machine Learning Research, 2022 - jmlr.org
This paper investigates the theoretical foundations of the t-distributed stochastic neighbor
embedding (t-SNE) algorithm, a popular nonlinear dimension reduction and data …

Approximate message passing from random initialization with applications to Z2 synchronization

G Li, W Fan, Y Wei - … of the National Academy of Sciences, 2023 - National Acad Sciences
This paper is concerned with the problem of reconstructing an unknown rank-one matrix with
prior structural information from noisy observations. While computing the Bayes optimal …

An Eigenvector Perturbation Bound and Its Application

J Fan, W Wang, Y Zhong - Journal of Machine Learning Research, 2018 - jmlr.org
In statistics and machine learning, we are interested in the eigenvectors (or singular vectors)
of certain matrices (eg covariance matrices, data matrices, etc). However, those matrices are …

The two-to-infinity norm and singular subspace geometry with applications to high-dimensional statistics

J Cape, M Tang, CE Priebe - 2019 - projecteuclid.org
The singular value matrix decomposition plays a ubiquitous role throughout statistics and
related fields. Myriad applications including clustering, classification, and dimensionality …

An optimal statistical and computational framework for generalized tensor estimation

R Han, R Willett, AR Zhang - The Annals of Statistics, 2022 - projecteuclid.org
An optimal statistical and computational framework for generalized tensor estimation Page 1 The
Annals of Statistics 2022, Vol. 50, No. 1, 1–29 https://doi.org/10.1214/21-AOS2061 © Institute of …

Hamiltonian tomography via quantum quench

Z Li, L Zou, TH Hsieh - Physical review letters, 2020 - APS
We show that it is possible to uniquely reconstruct a generic many-body local Hamiltonian
from a single pair of generic initial and final states related by evolving with the Hamiltonian …

An theory of PCA and spectral clustering

E Abbe, J Fan, K Wang - The Annals of Statistics, 2022 - projecteuclid.org
An lp theory of PCA and spectral clustering Page 1 The Annals of Statistics 2022, Vol. 50, No.
4, 2359–2385 https://doi.org/10.1214/22-AOS2196 © Institute of Mathematical Statistics, 2022 …

Quantifying common and distinct information in single-cell multimodal data with Tilted Canonical Correlation Analysis

KZ Lin, NR Zhang - … of the National Academy of Sciences, 2023 - National Acad Sciences
Multimodal single-cell technologies profile multiple modalities for each cell simultaneously,
enabling a more thorough characterization of cell populations. Existing dimension-reduction …