Assessing deep learning: a work program for the humanities in the age of artificial intelligence

J Segessenmann, T Stadelmann, A Davison, O Dürr - AI and Ethics, 2023 - Springer
Following the success of deep learning (DL) in research, we are now witnessing the fast and
widespread adoption of artificial intelligence (AI) in daily life, influencing the way we act …

Linearized ADMM converges to second-order stationary points for non-convex problems

S Lu, JD Lee, M Razaviyayn… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
In this work, a gradient-based primal-dual method of multipliers is proposed for solving a
class of linearly constrained non-convex problems. We show that with random initialization …

Guaranteed recovery of one-hidden-layer neural networks via cross entropy

H Fu, Y Chi, Y Liang - IEEE transactions on signal processing, 2020 - ieeexplore.ieee.org
We study model recovery for data classification, where the training labels are generated
from a one-hidden-layer neural network with sigmoid activations, also known as a single …

PA-GD: On the convergence of perturbed alternating gradient descent to second-order stationary points for structured nonconvex optimization

S Lu, M Hong, Z Wang - International Conference on …, 2019 - proceedings.mlr.press
Alternating gradient descent (A-GD) is a simple but popular algorithm in machine learning,
which updates two blocks of variables in an alternating manner using gradient descent …

Local geometry of cross entropy loss in learning one-hidden-layer neural networks

H Fu, Y Chi, Y Liang - 2019 IEEE International Symposium on …, 2019 - ieeexplore.ieee.org
We study model recovery for data classification, where the training labels are generated
from a one-hidden-layer neural network with sigmoid activations, and the goal is to recover …

Локальная кривизна и точность нейросетевой аппроксимации

ММ Краснов, ВС Смолин - XXI МЕЖДУНАРОДНАЯ НАУЧНО …, 2019 - elibrary.ru
Рассматривается задача повышения точности одномерной аппроксимации функций на
нейросети с одним скрытым слоем. В качестве примера целевой функции …