Sketch-based empirical natural gradient methods for deep learning M Yang, D Xu, Z Wen, M Chen, P Xu Journal of Scientific Computing 92 (3), 94, 2022 | 23 | 2022 |
A stochastic extra-step quasi-Newton method for nonsmooth nonconvex optimization M Yang, A Milzarek, Z Wen, T Zhang Mathematical Programming, 1-47, 2022 | 22 | 2022 |
Enhance Curvature Information by Structured Stochastic Quasi-Newton Methods M Yang, D Xu, C Hongyu, W Zaiwen, M Chen Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 10 | 2021 |
An efficient Fisher matrix approximation method for large-scale neural network optimization M Yang, D Xu, Q Cui, Z Wen, P Xu IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (5), 5391-5403, 2022 | 8* | 2022 |
Riemannian natural gradient methods J Hu, R Ao, AMC So, M Yang, Z Wen SIAM Journal on Scientific Computing 46 (1), A204-A231, 2024 | 4 | 2024 |
Supplementary Material: Enhance Curvature Information by Structured Stochastic Quasi-Newton Methods M Yang, D Xu, H Chen, Z Wen, M Chen | | |