An investigation of how neural networks learn from the experiences of peers through periodic...

Deep neural network fusion via graph matching with applications to model ensemble and federated learning

C Liu, C Lou, R Wang, AY Xi… - … on Machine Learning, 2022 - proceedings.mlr.press

Abstract Model fusion without accessing training data in machine learning has attracted
increasing interest due to the practical resource-saving and data privacy issues. During the …

被引用次数：65 相关文章所有 4 个版本

[PDF] neurips.cc

Model fusion via optimal transport

SP Singh, M Jaggi - Advances in Neural Information …, 2020 - proceedings.neurips.cc

Combining different models is a widely used paradigm in machine learning applications.
While the most common approach is to form an ensemble of models and average their …

被引用次数：234 相关文章所有 10 个版本

[PDF] arxiv.org

Arcee's MergeKit: A Toolkit for Merging Large Language Models

C Goddard, S Siriwardhana, M Ehghaghi… - arXiv preprint arXiv …, 2024 - arxiv.org

The rapid expansion of the open-source language model landscape presents an opportunity
to merge the competencies of these model checkpoints by combining their parameters …

被引用次数：72 相关文章所有 3 个版本

[PDF] arxiv.org

Deep model fusion: A survey

W Li, Y Peng, M Zhang, L Ding, H Hu… - arXiv preprint arXiv …, 2023 - arxiv.org

Deep model fusion/merging is an emerging technique that merges the parameters or
predictions of multiple deep learning models into a single one. It combines the abilities of …

被引用次数：51 相关文章所有 2 个版本

[PDF] arxiv.org

Heterogeneous federated learning

F Yu, W Zhang, Z Qin, Z Xu, D Wang, C Liu… - arXiv preprint arXiv …, 2020 - arxiv.org

Federated learning learns from scattered data by fusing collaborative models from local
nodes. However, due to chaotic information distribution, the model fusion may suffer from …

被引用次数：43 相关文章所有 3 个版本

[PDF] arxiv.org

Lora soups: Merging loras for practical skill composition tasks

A Prabhakar, Y Li, K Narasimhan, S Kakade… - arXiv preprint arXiv …, 2024 - arxiv.org

Low-Rank Adaptation (LoRA) is a popular technique for parameter-efficient fine-tuning of
Large Language Models (LLMs). We study how different LoRA modules can be merged to …

被引用次数：2 相关文章所有 3 个版本

[PDF] arxiv.org

Non-iterative knowledge fusion in deep convolutional neural networks

MI Leontev, V Islenteva, SV Sukhov - Neural Processing Letters, 2020 - Springer

Incorporation of new knowledge into neural networks with simultaneous preservation of the
previous knowledge is known to be a nontrivial problem. This problem becomes even more …

被引用次数：23 相关文章所有 9 个版本

Neural-aware Decoupling Fusion based Personalized Federated Learning for Intelligent Sensing

Y Gao, L Shen, liang Liu, Z Cao, D Tao, H Ma… - ACM Transactions on …, 2024 - dl.acm.org

Personalized federated learning (PFL) is a framework that targets individual models for
optimization, providing better privacy and flexibility for clients. However, in challenging …

[PDF] arxiv.org

On cross-layer alignment for model fusion of heterogeneous neural networks

D Nguyen, T Nguyen, K Nguyen… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org

OTFusion, or layer-wise model fusion via optimal transport, applies soft neuron association
to unify different pre-trained networks. Despite its effectiveness in saving computational …

被引用次数：9 相关文章所有 5 个版本

Model Fusion via Neuron Transplantation

M Öz, N Kiefer, C Debus, J Hörter, A Streit… - … European Conference on …, 2024 - Springer

Ensemble learning is a widespread technique to improve the prediction performance of
neural networks. However, it comes at the price of increased memory and inference time. In …

被引用次数：1 相关文章所有 4 个版本

高级搜索

QQ 群