Post-selection inference

AK Kuchibhotla, JE Kolassa… - Annual Review of …, 2022 - annualreviews.org
We discuss inference after data exploration, with a particular focus on inference after model
or variable selection. We review three popular approaches to this problem: sample splitting …

Splitting strategies for post-selection inference

DG Rasines, GA Young - Biometrika, 2023 - academic.oup.com
We consider the problem of providing valid inference for a selected parameter in a sparse
regression setting. It is well known that classical regression tools can be unreliable in this …

Debiased inference on treatment effect in a high-dimensional model

J Wang, X He, G Xu - Journal of the American Statistical …, 2020 - Taylor & Francis
This article concerns the potential bias in statistical inference on treatment effects when a
large number of covariates are present in a linear or partially linear model. While the …

Reducing the complexity of high-dimensional environmental data: An analytical framework using LASSO with considerations of confounding for statistical inference

S Frndak, G Yu, Y Oulhote, EI Queirolo, G Barg… - International journal of …, 2023 - Elsevier
Purpose Frameworks for selecting exposures in high-dimensional environmental datasets,
while considering confounding, are lacking. We present a two-step approach for exposure …

Empirical priors for prediction in sparse high-dimensional linear regression

R Martin, Y Tang - Journal of Machine Learning Research, 2020 - jmlr.org
In this paper we adopt the familiar sparse, high-dimensional linear regression model and
focus on the important but often overlooked task of prediction. In particular, we consider a …

Exact uniformly most powerful postselection confidence distributions

AC Garcia‐Angulo, G Claeskens - Scandinavian Journal of …, 2023 - Wiley Online Library
A conditioning on the event of having selected one model from a set of possibly misspecified
normal linear regression models leads to the construction of uniformly optimal conditional …

Understanding forms of childhood adversities and associations with adult health outcomes: a regression tree analysis

SP Perrins, E Vermes, K Cincotta, Y Xu… - Child Abuse & …, 2024 - Elsevier
Background Empirical studies have demonstrated associations between ten original
adverse childhood experiences (ACEs) and multiple health outcomes. Identifying expanded …

Assumption lean regression

R Berk, A Buja, L Brown, E George… - The American …, 2021 - Taylor & Francis
It is well known that with observational data, models used in conventional regression
analyses are commonly misspecified. Yet in practice, one tends to proceed with …

Neighborhood-based cross fitting approach to treatment effects with high-dimensional data

OD Agboola, H Yu - Computational Statistics & Data Analysis, 2023 - Elsevier
High-dimensional data are increasingly popular in various physical, biological and social
disciplines. A common existing approach of repeatedly splitting data was suggested to …

Quantifying Overfitting: Evaluating Neural Network Performance through Analysis of Null Space

H Rezaei, M Sabokrou - arXiv preprint arXiv:2305.19424, 2023 - arxiv.org
Machine learning models that are overfitted/overtrained are more vulnerable to knowledge
leakage, which poses a risk to privacy. Suppose we download or receive a model from a …