Spectral methods for data science: A statistical perspective

Y Chen, Y Chi, J Fan, C Ma - Foundations and Trends® in …, 2021 - nowpublishers.com
Spectral methods have emerged as a simple yet surprisingly effective approach for
extracting information from massive, noisy and incomplete data. In a nutshell, spectral …

Random Geometric Graph: Some recent developments and perspectives

Q Duchemin, Y De Castro - High Dimensional Probability IX: The Ethereal …, 2023 - Springer
Abstract The Random Geometric Graph (RGG) is a random graph model for network data
with an underlying spatial representation. Geometry endows RGGs with a rich dependence …

Bandits with many optimal arms

R De Heide, J Cheshire, P Ménard… - Advances in Neural …, 2021 - proceedings.neurips.cc
We consider a stochastic bandit problem with a possibly infinite number of arms. We write $
p^* $ for the proportion of optimal arms and $\Delta $ for the minimal mean-gap between …

A unified framework for spectral clustering in sparse graphs

L Dall'Amico, R Couillet, N Tremblay - Journal of Machine Learning …, 2021 - jmlr.org
This article considers spectral community detection in the regime of sparse networks with
heterogeneous degree distributions, for which we devise an algorithm to efficiently retrieve …

Uniform bounds for invariant subspace perturbations

A Damle, Y Sun - SIAM Journal on Matrix Analysis and Applications, 2020 - SIAM
For a fixed symmetric matrix A and symmetric perturbation E we develop purely deterministic
bounds on how invariant subspaces of A and A+E can differ when measured by a suitable …

Bayesian spiked Laplacian graphs

LL Duan, G Michailidis, M Ding - Journal of Machine Learning Research, 2023 - jmlr.org
In network analysis, it is common to work with a collection of graphs that exhibit
heterogeneity. For example, neuroimaging data from patient cohorts are increasingly …

Community Detection and Classification Guarantees Using Embeddings Learned by Node2Vec

A Davison, SC Morgan, OG Ward - arXiv preprint arXiv:2310.17712, 2023 - arxiv.org
Embedding the nodes of a large network into an Euclidean space is a common objective in
modern machine learning, with a variety of tools available. These embeddings can then be …

Computation of the sample Fréchet mean for sets of large graphs with applications to regression

D Ferguson, FG Meyer - Proceedings of the 2022 SIAM International …, 2022 - SIAM
To characterize the location (mean, median) of a set of graphs, one needs a notion of
centrality that is adapted to metric spaces, since graph sets are not Euclidean spaces. A …

On the validity of conformal prediction for network data under non-uniform sampling

R Lunde - arXiv preprint arXiv:2306.07252, 2023 - arxiv.org
We study the properties of conformal prediction for network data under various sampling
mechanisms that commonly arise in practice but often result in a non-representative sample …

Signed Diverse Multiplex Networks: Clustering and Inference

M Pensky - arXiv preprint arXiv:2402.10242, 2024 - arxiv.org
The paper introduces a Signed Generalized Random Dot Product Graph (SGRDPG) model,
which is a variant of the Generalized Random Dot Product Graph (GRDPG), where, in …