相关文章- 学术资源搜索

Towards Hyperparameter-Agnostic DNN Training via Dynamical System Insights

C Fiscko, A Agarwal, Y Ruan, S Kar, L Pileggi… - arXiv preprint arXiv …, 2023 - arxiv.org

We present a stochastic first-order optimization method specialized for deep neural networks
(DNNs), ECCO-DNN. This method models the optimization variable trajectory as a …

Lookahead optimizer: k steps forward, 1 step back

M Zhang, J Lucas, J Ba… - Advances in neural …, 2019 - proceedings.neurips.cc

The vast majority of successful deep neural networks are trained using variants of stochastic
gradient descent (SGD) algorithms. Recent attempts to improve SGD can be broadly …

被引用次数：759 相关文章所有 14 个版本

[PDF] arxiv.org

Training aware sigmoidal optimizer

D Macêdo, P Dreyer, T Ludermir… - arXiv preprint arXiv …, 2021 - arxiv.org

Proper optimization of deep neural networks is an open research question since an optimal
procedure to change the learning rate throughout training is still unknown. Manually defining …

被引用次数：1 相关文章所有 3 个版本

[PDF] arxiv.org

AdaFisher: Adaptive Second Order Optimization via Fisher Information

DM Gomes, Y Zhang, E Belilovsky, G Wolf… - arXiv preprint arXiv …, 2024 - arxiv.org

First-order optimization methods are currently the mainstream in training deep neural
networks (DNNs). Optimizers like Adam incorporate limited curvature information by …

[PDF] arxiv.org

Sania: Polyak-type optimization framework leads to scale invariant stochastic algorithms

F Abdukhakimov, C Xiang, D Kamzolov… - arXiv preprint arXiv …, 2023 - arxiv.org

Adaptive optimization methods are widely recognized as among the most popular
approaches for training Deep Neural Networks (DNNs). Techniques such as Adam …

被引用次数：2 相关文章所有 2 个版本

[PDF] aaai.org

Understanding Stochastic Optimization Behavior at the Layer Update Level (Student Abstract)

J Zhang, GX Qiao, A Lopotenco, IT Pan - Proceedings of the AAAI …, 2022 - ojs.aaai.org

Popular first-order stochastic optimization methods for deep neural networks (DNNs) are
usually either accelerated schemes (eg stochastic gradient descent (SGD) with momentum) …

PID controller-based stochastic optimization acceleration for deep neural networks

H Wang, Y Luo, W An, Q Sun, J Xu… - IEEE transactions on …, 2020 - ieeexplore.ieee.org

Deep neural networks (DNNs) are widely used and demonstrated their power in many
applications, such as computer vision and pattern recognition. However, the training of these …

被引用次数：59 相关文章所有 6 个版本

Adapid: An adaptive pid optimizer for training deep neural networks

B Weng, J Sun, A Sadeghi… - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org

Deep neural networks (DNNs) have well-documented merits in learning nonlinear functions
in high-dimensional spaces. Stochastic gradient descent (SGD)-type optimization algorithms …

被引用次数：7 相关文章

[PDF] arxiv.org

Ddpnopt: Differential dynamic programming neural optimizer

GH Liu, T Chen, EA Theodorou - arXiv preprint arXiv:2002.08809, 2020 - arxiv.org

Interpretation of Deep Neural Networks (DNNs) training as an optimal control problem with
nonlinear dynamical systems has received considerable attention recently, yet the …

被引用次数：7 相关文章所有 8 个版本

[PDF] arxiv.org

Deepobs: A deep learning optimizer benchmark suite

F Schneider, L Balles, P Hennig - arXiv preprint arXiv:1903.05499, 2019 - arxiv.org

Because the choice and tuning of the optimizer affects the speed, and ultimately the
performance of deep learning, there is significant past and recent research in this area. Yet …

被引用次数：67 相关文章所有 5 个版本

高级搜索

QQ 群

Towards Hyperparameter-Agnostic DNN Training via Dynamical System Insights

Lookahead optimizer: k steps forward, 1 step back

Training aware sigmoidal optimizer

AdaFisher: Adaptive Second Order Optimization via Fisher Information

Sania: Polyak-type optimization framework leads to scale invariant stochastic algorithms

Understanding Stochastic Optimization Behavior at the Layer Update Level (Student Abstract)

PID controller-based stochastic optimization acceleration for deep neural networks

Adapid: An adaptive pid optimizer for training deep neural networks

Ddpnopt: Differential dynamic programming neural optimizer

Deepobs: A deep learning optimizer benchmark suite

引用