Deep neural network fusion via graph matching with applications to model ensemble and federated learning

C Liu, C Lou, R Wang, AY Xi… - … on Machine Learning, 2022 - proceedings.mlr.press
Abstract Model fusion without accessing training data in machine learning has attracted
increasing interest due to the practical resource-saving and data privacy issues. During the …

Model fusion via optimal transport

SP Singh, M Jaggi - Advances in Neural Information …, 2020 - proceedings.neurips.cc
Combining different models is a widely used paradigm in machine learning applications.
While the most common approach is to form an ensemble of models and average their …

Arcee's MergeKit: A Toolkit for Merging Large Language Models

C Goddard, S Siriwardhana, M Ehghaghi… - arXiv preprint arXiv …, 2024 - arxiv.org
The rapid expansion of the open-source language model landscape presents an opportunity
to merge the competencies of these model checkpoints by combining their parameters …

Deep model fusion: A survey

W Li, Y Peng, M Zhang, L Ding, H Hu… - arXiv preprint arXiv …, 2023 - arxiv.org
Deep model fusion/merging is an emerging technique that merges the parameters or
predictions of multiple deep learning models into a single one. It combines the abilities of …

Heterogeneous federated learning

F Yu, W Zhang, Z Qin, Z Xu, D Wang, C Liu… - arXiv preprint arXiv …, 2020 - arxiv.org
Federated learning learns from scattered data by fusing collaborative models from local
nodes. However, due to chaotic information distribution, the model fusion may suffer from …

Lora soups: Merging loras for practical skill composition tasks

A Prabhakar, Y Li, K Narasimhan, S Kakade… - arXiv preprint arXiv …, 2024 - arxiv.org
Low-Rank Adaptation (LoRA) is a popular technique for parameter-efficient fine-tuning of
Large Language Models (LLMs). We study how different LoRA modules can be merged to …

Non-iterative knowledge fusion in deep convolutional neural networks

MI Leontev, V Islenteva, SV Sukhov - Neural Processing Letters, 2020 - Springer
Incorporation of new knowledge into neural networks with simultaneous preservation of the
previous knowledge is known to be a nontrivial problem. This problem becomes even more …

Neural-aware Decoupling Fusion based Personalized Federated Learning for Intelligent Sensing

Y Gao, L Shen, liang Liu, Z Cao, D Tao, H Ma… - ACM Transactions on …, 2024 - dl.acm.org
Personalized federated learning (PFL) is a framework that targets individual models for
optimization, providing better privacy and flexibility for clients. However, in challenging …

On cross-layer alignment for model fusion of heterogeneous neural networks

D Nguyen, T Nguyen, K Nguyen… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org
OTFusion, or layer-wise model fusion via optimal transport, applies soft neuron association
to unify different pre-trained networks. Despite its effectiveness in saving computational …

Model Fusion via Neuron Transplantation

M Öz, N Kiefer, C Debus, J Hörter, A Streit… - … European Conference on …, 2024 - Springer
Ensemble learning is a widespread technique to improve the prediction performance of
neural networks. However, it comes at the price of increased memory and inference time. In …