A survey on statistical theory of deep learning: Approximation, training dynamics, and generative models

N Suh, G Cheng - Annual Review of Statistics and Its Application, 2024 - annualreviews.org
In this article, we review the literature on statistical theories of neural networks from three
perspectives: approximation, training dynamics, and generative models. In the first part …

A universal law of robustness via isoperimetry

S Bubeck, M Sellke - Advances in Neural Information …, 2021 - proceedings.neurips.cc
Classically, data interpolation with a parametrized model class is possible as long as the
number of parameters is larger than the number of equations to be satisfied. A puzzling …

A universal law of robustness via isoperimetry

S Bubeck, M Sellke - Journal of the ACM, 2023 - dl.acm.org
Classically, data interpolation with a parametrized model class is possible as long as the
number of parameters is larger than the number of equations to be satisfied. A puzzling …

Gradient normalization for generative adversarial networks

YL Wu, HH Shuai, ZR Tam… - Proceedings of the IEEE …, 2021 - openaccess.thecvf.com
In this paper, we propose a novel normalization method called gradient normalization (GN)
to tackle the training instability of Generative Adversarial Networks (GANs) caused by the …

Error bounds of imitating policies and environments

T Xu, Z Li, Y Yu - Advances in Neural Information …, 2020 - proceedings.neurips.cc
Imitation learning trains a policy by mimicking expert demonstrations. Various imitation
methods were proposed and empirically evaluated, meanwhile, their theoretical …

Improving neural language generation with spectrum control

L Wang, J Huang, K Huang, Z Hu… - International …, 2019 - openreview.net
Recent Transformer-based models such as Transformer-XL and BERT have achieved huge
success on various natural language processing tasks. However, contextualized …

Why spectral normalization stabilizes gans: Analysis and improvements

Z Lin, V Sekar, G Fanti - Advances in neural information …, 2021 - proceedings.neurips.cc
Spectral normalization (SN) is a widely-used technique for improving the stability and
sample quality of Generative Adversarial Networks (GANs). However, current understanding …

Stabilizing gans' training with brownian motion controller

T Luo, Z Zhu, J Chen, J Zhu - International Conference on …, 2023 - proceedings.mlr.press
The training process of generative adversarial networks (GANs) is unstable and does not
converge globally. In this paper, we examine the stability of GANs from the perspective of …

Inferential Wasserstein generative adversarial networks

Y Chen, Q Gao, X Wang - Journal of the Royal Statistical Society …, 2022 - academic.oup.com
Generative adversarial networks (GANs) have been impactful on many problems and
applications but suffer from unstable training. The Wasserstein GAN (WGAN) leverages the …

Sugan: A stable u-net based generative adversarial network

S Cheng, L Wang, M Zhang, C Zeng, Y Meng - Sensors, 2023 - mdpi.com
As one of the representative models in the field of image generation, generative adversarial
networks (GANs) face a significant challenge: how to make the best trade-off between the …