Projection‐based techniques for high‐dimensional optimal transport problems

J Zhang, P Ma, W Zhong, C Meng - Wiley Interdisciplinary …, 2023 - Wiley Online Library
Optimal transport (OT) methods seek a transformation map (or plan) between two probability
measures, such that the transformation has the minimum transportation cost. Such a …

Model-free conditional feature screening with FDR control

Z Tong, Z Cai, S Yang, R Li - Journal of the American Statistical …, 2023 - Taylor & Francis
In this article, we propose a model-free conditional feature screening method with false
discovery rate (FDR) control for ultra-high dimensional data. The proposed method is built …

A generic model-free feature screening procedure for ultra-high dimensional data with categorical response

X Cheng, H Wang - Computer Methods and Programs in Biomedicine, 2023 - Elsevier
Background and objective: Identifying active features from ultra-high dimensional data is
one of the primary and vital tasks in statistical learning and biological discovery. Methods: In …

A Model‐Free Feature Selection Technique of Feature Screening and Random Forest‐Based Recursive Feature Elimination

S Xia, Y Yang - International Journal of Intelligent Systems, 2023 - Wiley Online Library
This paper studies data with mass features, commonly observed in applications such as text
classification and medical diagnosis. We allow data to have several structures without …

FDR control and power analysis for high-dimensional logistic regression via StabKoff

P Yuan, Y Kong, G Li - Statistical Papers, 2024 - Springer
Identifying significant variables for the high-dimensional logistic regression model is a
fundamental problem in modern statistics and machine learning. This paper introduces a …

A tradeoff between false discovery and true positive proportions for sparse high-dimensional logistic regression

J Zhou, G Claeskens - Electronic Journal of Statistics, 2024 - projecteuclid.org
The logistic regression model is a simple and classic approach to binary classification,
where in sparse high-dimensional settings, one believes that only a small proportion of the …

A transparent and nonlinear method for variable selection

K Wang, H Wang, J Zhao, L Wang - Expert Systems with Applications, 2024 - Elsevier
Variable selection is a procedure to obtain truly important predictors from inputs. Complex
nonlinear dependencies and strong coupling pose great challenges for variable selection in …

Scalable model-free feature screening via sliced-wasserstein dependency

T Li, J Yu, C Meng - Journal of Computational and Graphical …, 2023 - Taylor & Francis
We consider the model-free feature screening problem that aims to discard non-informative
features before downstream analysis. Most of the existing feature screening approaches …

High‐dimensional feature screening for nonlinear associations with survival outcome using restricted mean survival time

Y Chen, K Fai Lam, Z Liu - Stat, 2024 - Wiley Online Library
Feature screening is an important tool in analysing ultrahigh‐dimensional data, particularly
in the field of Omics and oncology studies. However, most attention has been focused on …

Reproducible learning in large-scale graphical models

J Zhou, Y Li, Z Zheng, D Li - Journal of multivariate analysis, 2022 - Elsevier
Learning the conditional dependence structures through high-dimensional graphical models
is of fundamental importance in many contemporary applications. Despite the fast growing …