A review of off-policy evaluation in reinforcement learning

M Uehara, C Shi, N Kallus - arXiv preprint arXiv:2212.06355, 2022 - arxiv.org
Reinforcement learning (RL) is one of the most vibrant research frontiers in machine
learning and has been recently applied to solve a number of challenging problems. In this …

Score approximation, estimation and distribution recovery of diffusion models on low-dimensional data

M Chen, K Huang, T Zhao… - … Conference on Machine …, 2023 - proceedings.mlr.press
Diffusion models achieve state-of-the-art performance in various generation tasks. However,
their theoretical foundations fall far behind. This paper studies score approximation …

Diffusion models are minimax optimal distribution estimators

K Oko, S Akiyama, T Suzuki - International Conference on …, 2023 - proceedings.mlr.press
While efficient distribution learning is no doubt behind the groundbreaking success of
diffusion modeling, its theoretical guarantees are quite limited. In this paper, we provide the …

Personalized federated learning via variational bayesian inference

X Zhang, Y Li, W Li, K Guo… - … Conference on Machine …, 2022 - proceedings.mlr.press
Federated learning faces huge challenges from model overfitting due to the lack of data and
statistical diversity among clients. To address these challenges, this paper proposes a novel …

The modern mathematics of deep learning

J Berner, P Grohs, G Kutyniok… - arXiv preprint arXiv …, 2021 - cambridge.org
We describe the new field of the mathematical analysis of deep learning. This field emerged
around a list of research questions that were not answered within the classical framework of …

Reward-directed conditional diffusion: Provable distribution estimation and reward improvement

H Yuan, K Huang, C Ni, M Chen… - Advances in Neural …, 2024 - proceedings.neurips.cc
We explore the methodology and theory of reward-directed generation via conditional
diffusion models. Directed generation aims to generate samples with desired properties as …

Deep network approximation for smooth functions

J Lu, Z Shen, H Yang, S Zhang - SIAM Journal on Mathematical Analysis, 2021 - SIAM
This paper establishes the optimal approximation error characterization of deep rectified
linear unit (ReLU) networks for smooth functions in terms of both width and depth …

A survey on statistical theory of deep learning: Approximation, training dynamics, and generative models

N Suh, G Cheng - Annual Review of Statistics and Its Application, 2024 - annualreviews.org
In this article, we review the literature on statistical theories of neural networks from three
perspectives: approximation, training dynamics, and generative models. In the first part …

Deep MLP-CNN model using mixed-data to distinguish between COVID-19 and Non-COVID-19 patients

MM Ahsan, T E. Alam, T Trafalis, P Huebner - Symmetry, 2020 - mdpi.com
The limitations and high false-negative rates (30%) of COVID-19 test kits have been a
prominent challenge during the 2020 coronavirus pandemic. Manufacturing those kits and …

Nonparametric regression on low-dimensional manifolds using deep ReLU networks: Function approximation and statistical recovery

M Chen, H Jiang, W Liao, T Zhao - Information and Inference: A …, 2022 - academic.oup.com
Real-world data often exhibit low-dimensional geometric structures and can be viewed as
samples near a low-dimensional manifold. This paper studies nonparametric regression of …