Feature selection: A data perspective

J Li, K Cheng, S Wang, F Morstatter… - ACM computing …, 2017 - dl.acm.org
Feature selection, as a data preprocessing strategy, has been proven to be effective and
efficient in preparing data (especially high-dimensional data) for various data-mining and …

The little engine that could: Regularization by denoising (RED)

Y Romano, M Elad, P Milanfar - SIAM Journal on Imaging Sciences, 2017 - SIAM
Removal of noise from an image is an extensively studied problem in image processing.
Indeed, the recent advent of sophisticated and highly effective denoising algorithms has led …

Extended comparisons of best subset selection, forward stepwise selection, and the lasso

T Hastie, R Tibshirani, RJ Tibshirani - arXiv preprint arXiv:1707.08692, 2017 - arxiv.org
In exciting new work, Bertsimas et al.(2016) showed that the classical best subset selection
problem in regression modeling can be formulated as a mixed integer optimization (MIO) …

Program evaluation and causal inference with high‐dimensional data

A Belloni, V Chernozhukov, I Fernandez‐Val… - …, 2017 - Wiley Online Library
In this paper, we provide efficient estimators and honest confidence bands for a variety of
treatment effects including local average (LATE) and local quantile treatment effects (LQTE) …

Convolutional neural networks analyzed via convolutional sparse coding

V Papyan, Y Romano, M Elad - Journal of Machine Learning Research, 2017 - jmlr.org
Convolutional neural networks (CNN) have led to many state-of-the-art results spanning
through various fields. However, a clear and profound theoretical understanding of the …

A general theory of hypothesis tests and confidence regions for sparse high dimensional models

Y Ning, H Liu - 2017 - projecteuclid.org
A general theory of hypothesis tests and confidence regions for sparse high dimensional
models Page 1 The Annals of Statistics 2017, Vol. 45, No. 1, 158–195 DOI: 10.1214/16-AOS1448 …

Learning the structure of generative models without labeled data

SH Bach, B He, A Ratner, C Ré - … Conference on Machine …, 2017 - proceedings.mlr.press
Curating labeled training data has become the primary bottleneck in machine learning.
Recent frameworks address this bottleneck with generative models to synthesize labels at …

Confidence intervals for high-dimensional linear regression: Minimax rates and adaptivity

TT Cai, Z Guo - 2017 - projecteuclid.org
Supplement to “Confidence intervals for high-dimensional linear regression: Minimax rates
and adaptivity”. Detailed proofs of the adaptivity lower bound and minimax upper bound for …

Instrumental variable quantile regression

V Chernozhukov, C Hansen… - Handbook of quantile …, 2017 - taylorfrancis.com
This chapter reviews the instrumental variable quantile regression model of Chernozhukov
and Hansen. It discusses the key conditions used for identification of structural quantile …

Support recovery without incoherence: A case for nonconvex regularization

PL Loh, MJ Wainwright - 2017 - projecteuclid.org
Support recovery without incoherence: A case for nonconvex regularization Page 1 The Annals
of Statistics 2017, Vol. 45, No. 6, 2455–2482 https://doi.org/10.1214/16-AOS1530 © Institute …