A method for finding similarity between multi-layer perceptrons by forward bipartite alignment

A Navon, A Shamsian, I Achituve… - International …, 2023 - proceedings.mlr.press

Designing machine learning architectures for processing neural networks in their raw weight
matrix form is a newly introduced research direction. Unfortunately, the unique symmetry …

被引用次数：58 相关文章所有 6 个版本

[PDF] arxiv.org

Zipit! merging models from different tasks without training

G Stoica, D Bolya, J Bjorner, P Ramesh… - arXiv preprint arXiv …, 2023 - arxiv.org

Typical deep visual recognition models are capable of performing the one task they were
trained on. In this paper, we tackle the extremely difficult problem of combining completely …

被引用次数：76 相关文章所有 5 个版本

[PDF] arxiv.org

Repair: Renormalizing permuted activations for interpolation repair

K Jordan, H Sedghi, O Saukh, R Entezari… - arXiv preprint arXiv …, 2022 - arxiv.org

In this paper we look into the conjecture of Entezari et al.(2021) which states that if the
permutation invariance of neural networks is taken into account, then there is likely no loss …

被引用次数：74 相关文章所有 7 个版本

[PDF] neurips.cc

Going beyond linear mode connectivity: The layerwise linear feature connectivity

Z Zhou, Y Yang, X Yang, J Yan… - Advances in Neural …, 2023 - proceedings.neurips.cc

Recent work has revealed many intriguing empirical phenomena in neural network training,
despite the poorly understood and highly complex loss landscapes and training dynamics …

被引用次数：19 相关文章所有 5 个版本

Understanding and mitigating dimensional collapse in federated learning

Y Shi, J Liang, W Zhang, C Xue… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Federated learning aims to train models collaboratively across different clients without
sharing data for privacy considerations. However, one major challenge for this learning …

被引用次数：10 相关文章所有 3 个版本

[PDF] arxiv.org

Training-free model merging for multi-target domain adaptation

W Li, H Gao, M Gao, B Tian, R Zhi, H Zhao - European Conference on …, 2025 - Springer

In this paper, we study multi-target domain adaptation of scene understanding models.
While previous methods achieved commendable results through inter-domain consistency …

被引用次数：3 相关文章所有 6 个版本

[PDF] arxiv.org

The empirical impact of neural parameter symmetries, or lack thereof

D Lim, TM Putterman, R Walters, H Maron… - arXiv preprint arXiv …, 2024 - arxiv.org

Many algorithms and observed phenomena in deep learning appear to be affected by
parameter symmetries--transformations of neural network parameters that do not change the …

被引用次数：3 相关文章所有 3 个版本

[PDF] ijesty.org

Artificial Neural Network Model For Wind Mill

Z Azmi - International Journal of Engineering, Science and …, 2021 - ijesty.org

Utilization of wind energy sources provides advantages in terms of being environmentally
friendly, and it can be energy source is realible. The analysis of wind mill control using …

被引用次数：18 相关文章所有 6 个版本

[PDF] sciencedirect.com

Aligned deep neural network for integrative analysis with high-dimensional input

S Zhang, S Zhang, H Yi, S Ma - Journal of Biomedical Informatics, 2023 - Elsevier

Objective: Deep neural network (DNN) techniques have demonstrated significant
advantages over regression and some other techniques. In recent studies, DNN-based …

被引用次数：1 相关文章所有 4 个版本

[PDF] iospress.nl

Weight scope alignment: A frustratingly easy method for model merging

Y Xu, XC Li, L Gan, DC Zhan - ECAI 2024, 2024 - ebooks.iospress.nl

Merging models becomes a fundamental procedure in some applications that consider
model efficiency and robustness. The training randomness or Non-IID data poses a huge …

被引用次数：1 相关文章所有 5 个版本

高级搜索

QQ 群