Generalized equivalences between subsampling and ridge regularization

P Patil, JH Du - Advances in Neural Information Processing …, 2024 - proceedings.neurips.cc
We establish precise structural and risk equivalences between subsampling and ridge
regularization for ensemble ridge estimators. Specifically, we prove that linear and quadratic …

Subsample ridge ensembles: Equivalences and generalized cross-validation

JH Du, P Patil, AK Kuchibhotla - arXiv preprint arXiv:2304.13016, 2023 - arxiv.org
We study subsampling-based ridge ensembles in the proportional asymptotics regime,
where the feature size grows proportionally with the sample size such that their ratio …

Analysis of bootstrap and subsampling in high-dimensional regularized regression

L Clarté, A Vandenbroucque, G Dalle… - arXiv preprint arXiv …, 2024 - arxiv.org
We investigate popular resampling methods for estimating the uncertainty of statistical
models, such as subsampling, bootstrap and the jackknife, and their performance in high …

Corrected generalized cross-validation for finite ensembles of penalized estimators

PC Bellec, JH Du, T Koriyama, P Patil… - Journal of the Royal …, 2024 - academic.oup.com
Generalized cross-validation (GCV) is a widely used method for estimating the squared out-
of-sample prediction risk that employs scalar degrees of freedom adjustment (in a …

Bagging provides assumption-free stability

JA Soloff, RF Barber, R Willett - Journal of Machine Learning Research, 2024 - jmlr.org
Bagging is an important technique for stabilizing machine learning models. In this paper, we
derive a finite-sample guarantee on the stability of bagging for any model. Our result places …

Asymptotically free sketched ridge ensembles: Risks, cross-validation, and tuning

P Patil, D LeJeune - arXiv preprint arXiv:2310.04357, 2023 - arxiv.org
We employ random matrix theory to establish consistency of generalized cross validation
(GCV) for estimating prediction risks of sketched ridge regression ensembles, enabling …

Precise asymptotics of bagging regularized m-estimators

T Koriyama, P Patil, JH Du, K Tan, PC Bellec - arXiv preprint arXiv …, 2024 - arxiv.org
We characterize the squared prediction risk of ensemble estimators obtained through
subagging (subsample bootstrap aggregating) regularized M-estimators and construct a …

Optimal Ridge Regularization for Out-of-Distribution Prediction

P Patil, JH Du, RJ Tibshirani - arXiv preprint arXiv:2404.01233, 2024 - arxiv.org
We study the behavior of optimal ridge regularization and optimal ridge risk for out-of-
distribution prediction, where the test distribution deviates arbitrarily from the train …

Extrapolated cross-validation for randomized ensembles

JH Du, P Patil, K Roeder… - Journal of Computational …, 2024 - Taylor & Francis
Ensemble methods such as bagging and random forests are ubiquitous in various fields,
from finance to genomics. Despite their prevalence, the question of the efficient tuning of …

Implicit regularization paths of weighted neural representations

JH Du, P Patil - arXiv preprint arXiv:2408.15784, 2024 - arxiv.org
We study the implicit regularization effects induced by (observation) weighting of pretrained
features. For weight and feature matrices of bounded operator norms that are infinitesimally …