Equivariant architectures for learning in deep weight spaces

A Navon, A Shamsian, I Achituve… - International …, 2023 - proceedings.mlr.press
Designing machine learning architectures for processing neural networks in their raw weight
matrix form is a newly introduced research direction. Unfortunately, the unique symmetry …

Zipit! merging models from different tasks without training

G Stoica, D Bolya, J Bjorner, P Ramesh… - arXiv preprint arXiv …, 2023 - arxiv.org
Typical deep visual recognition models are capable of performing the one task they were
trained on. In this paper, we tackle the extremely difficult problem of combining completely …

Repair: Renormalizing permuted activations for interpolation repair

K Jordan, H Sedghi, O Saukh, R Entezari… - arXiv preprint arXiv …, 2022 - arxiv.org
In this paper we look into the conjecture of Entezari et al.(2021) which states that if the
permutation invariance of neural networks is taken into account, then there is likely no loss …

Going beyond linear mode connectivity: The layerwise linear feature connectivity

Z Zhou, Y Yang, X Yang, J Yan… - Advances in Neural …, 2023 - proceedings.neurips.cc
Recent work has revealed many intriguing empirical phenomena in neural network training,
despite the poorly understood and highly complex loss landscapes and training dynamics …

Understanding and mitigating dimensional collapse in federated learning

Y Shi, J Liang, W Zhang, C Xue… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Federated learning aims to train models collaboratively across different clients without
sharing data for privacy considerations. However, one major challenge for this learning …

Training-free model merging for multi-target domain adaptation

W Li, H Gao, M Gao, B Tian, R Zhi, H Zhao - European Conference on …, 2025 - Springer
In this paper, we study multi-target domain adaptation of scene understanding models.
While previous methods achieved commendable results through inter-domain consistency …

The empirical impact of neural parameter symmetries, or lack thereof

D Lim, TM Putterman, R Walters, H Maron… - arXiv preprint arXiv …, 2024 - arxiv.org
Many algorithms and observed phenomena in deep learning appear to be affected by
parameter symmetries--transformations of neural network parameters that do not change the …

Artificial Neural Network Model For Wind Mill

Z Azmi - International Journal of Engineering, Science and …, 2021 - ijesty.org
Utilization of wind energy sources provides advantages in terms of being environmentally
friendly, and it can be energy source is realible. The analysis of wind mill control using …

Aligned deep neural network for integrative analysis with high-dimensional input

S Zhang, S Zhang, H Yi, S Ma - Journal of Biomedical Informatics, 2023 - Elsevier
Objective: Deep neural network (DNN) techniques have demonstrated significant
advantages over regression and some other techniques. In recent studies, DNN-based …

Weight scope alignment: A frustratingly easy method for model merging

Y Xu, XC Li, L Gan, DC Zhan - ECAI 2024, 2024 - ebooks.iospress.nl
Merging models becomes a fundamental procedure in some applications that consider
model efficiency and robustness. The training randomness or Non-IID data poses a huge …