We prove a general Embedding Principle of loss landscape of deep neural networks (NNs) that unravels a hierarchical structure of the loss landscape of NNs, ie, loss landscape of an …
M Chen, R Niu, W Zheng - Nonlinear Dynamics, 2023 - Springer
In this paper, an adaptive multi-scale neural network with Resnet blocks (adaptive-MS- Resnet) architecture is constructed for solving the Poisson equation, Helmholtz equation …
Z Zhang, ZQJ Xu - IEEE Transactions on Pattern Analysis and …, 2024 - ieeexplore.ieee.org
It is important to understand how dropout, a popular regularization method, aids in achieving a good generalization solution during neural network training. In this work, we present a …
α-clustering structure is a significant topic in light nuclei. A Bayesian convolutional neural network (BCNN) is applied to classify initial nonclustered and clustered configurations …
XA Li, ZQJ Xu, L Zhang - Journal of Computational Physics, 2023 - Elsevier
While deep learning algorithms demonstrate a great potential in scientific computing, its application to multi-scale problems remains to be a big challenge. This is manifested by the …
X Li, ZQJ Xu, Z Zhang - arXiv preprint arXiv:2305.12133, 2023 - arxiv.org
In this work, we investigate the mechanism underlying loss spikes observed during neural network training. When the training enters a region with a lower-loss-as-sharper (LLAS) …
Z Zhang, Z Wang, J Yao, Z Zhou, X Li, ZQJ Xu - arXiv preprint arXiv …, 2024 - arxiv.org
Understanding transformer-based language models is becoming increasingly crucial, particularly as they play pivotal roles in advancing towards artificial general intelligence …
Generative adversarial networks (GANs) are a class of machine-learning models that use adversarial training to generate new samples with the same (potentially very complex) …
Y Peng, D Hu, ZQJ Xu - Journal of Computational Physics, 2023 - Elsevier
Deep learning has achieved wide success in solving Partial Differential Equations (PDEs), with particular strength in handling high dimensional problems and parametric problems …