Distance-Based Classification with Lipschitz Functions.

M Geiger, L Petrini, M Wyart - Physics Reports, 2021 - Elsevier

Deep learning algorithms are responsible for a technological revolution in a variety of tasks
including image recognition or Go playing. Yet, why they work is not understood. Ultimately …

被引用次数：39 相关文章所有 6 个版本

[PDF] neurips.cc

Exploring generalization in deep learning

B Neyshabur, S Bhojanapalli… - Advances in neural …, 2017 - proceedings.neurips.cc

With a goal of understanding what drives generalization in deep networks, we consider
several recently suggested explanations, including norm-based control, sharpness and …

被引用次数：1365 相关文章所有 8 个版本

[PDF] neurips.cc

A closer look at accuracy vs. robustness

YY Yang, C Rashtchian, H Zhang… - Advances in neural …, 2020 - proceedings.neurips.cc

Current methods for training robust networks lead to a drop in test accuracy, which has led
prior works to posit that a robustness-accuracy tradeoff may be inevitable in deep learning …

被引用次数：303 相关文章所有 10 个版本

[PDF] neurips.cc

Lipschitz regularity of deep neural networks: analysis and efficient estimation

A Virmaux, K Scaman - Advances in Neural Information …, 2018 - proceedings.neurips.cc

Deep neural networks are notorious for being sensitive to small well-chosen perturbations,
and estimating the regularity of such architectures is of utmost importance for safe and …

被引用次数：534 相关文章所有 6 个版本

[PDF] jmlr.org

Breaking the curse of dimensionality with convex neural networks

F Bach - Journal of Machine Learning Research, 2017 - jmlr.org

We consider neural networks with a single hidden layer and non-decreasing positively
homogeneous activation functions like the rectified linear units. By letting the number of …

被引用次数：798 相关文章所有 13 个版本

[PDF] neurips.cc

Parallelized stochastic gradient descent

M Zinkevich, M Weimer, L Li… - Advances in neural …, 2010 - proceedings.neurips.cc

With the increase in available data parallel machine learning has become an increasingly
pressing problem. In this paper we present the first parallel stochastic gradient descent …

被引用次数：1750 相关文章所有 9 个版本

[PDF] ieee.org

Semi-Supervised Learning (Chapelle, O. et al., Eds.; 2006) [Book reviews]

O Chapelle, B Scholkopf, A Zien - IEEE Transactions on Neural …, 2009 - ieeexplore.ieee.org

This book addresses some theoretical aspects of semisupervised learning (SSL). The book
is organized as a collection of different contributions of authors who are experts on this topic …

被引用次数：7603 相关文章所有 18 个版本

[PDF] huji.ac.il

Fast and robust earth mover's distances

O Pele, M Werman - 2009 IEEE 12th international conference …, 2009 - ieeexplore.ieee.org

We present a new algorithm for a robust family of Earth Mover's Distances-EMDs with
thresholded ground distances. The algorithm transforms the flow-network of the EMD so that …

被引用次数：1126 相关文章所有 11 个版本

[PDF] projecteuclid.org

On the empirical estimation of integral probability metrics

BK Sriperumbudur, K Fukumizu, A Gretton, B Schölkopf… - 2012 - projecteuclid.org

Given two probability measures, P and Q defined on a measurable space, S, the integral
probability metric (IPM) is defined as F (P, Q)=\sup\left {\left | S f\, d PS f\, d Q\right |\,:\, f ∈ …

被引用次数：350 相关文章所有 14 个版本

[PDF] arxiv.org

Finite-sample guarantees for Wasserstein distributionally robust optimization: Breaking the curse of dimensionality

R Gao - Operations Research, 2023 - pubsonline.informs.org

Wasserstein distributionally robust optimization (DRO) aims to find robust and generalizable
solutions by hedging against data perturbations in Wasserstein distance. Despite its recent …

被引用次数：85 相关文章所有 8 个版本

高级搜索

QQ 群