PLSR: Unstructured Pruning with Layer-Wise Sparsity Ratio

H Zhao, L Yu, R Guan, L Jia, J Zhang… - … on Machine Learning …, 2023 - ieeexplore.ieee.org
In the current era of multi-modal and large models gradually revealing their potential, neural
network pruning has emerged as a crucial means of model compression. It is widely …

Pruning deep neural networks from a sparsity perspective

E Diao, G Wang, J Zhan, Y Yang, J Ding… - arXiv preprint arXiv …, 2023 - arxiv.org
In recent years, deep network pruning has attracted significant attention in order to enable
the rapid deployment of AI into small devices with computation and memory constraints …

Pruning has a disparate impact on model accuracy

C Tran, F Fioretto, JE Kim… - Advances in neural …, 2022 - proceedings.neurips.cc
Network pruning is a widely-used compression technique that is able to significantly scale
down overparameterized models with minimal loss of accuracy. This paper shows that …

Sparsest Models Elude Pruning: An Exposé of Pruning's Current Capabilities

S Zhang, V Papyan - Forty-first International Conference on Machine … - openreview.net
Pruning has emerged as a promising approach for compressing large-scale models, yet its
effectiveness in recovering the sparsest of models has not yet been explored. We conducted …

[PDF][PDF] Supplementary: Pushing the Efficiency Limit Using Structured Sparse Convolutions

VK Verma, N Mehta, S Si, R Henao, L Carin - openaccess.thecvf.com
To show the efficacy of the proposed model we leverage three standard architectures VGG-
19 [5], ResNet-32 [1] and ResNet-50 [1]. We use these architecture on the small scale …

BMRS: Bayesian Model Reduction for Structured Pruning

D Wright, C Igel, R Selvan - arXiv preprint arXiv:2406.01345, 2024 - arxiv.org
Modern neural networks are often massively overparameterized leading to high compute
costs during training and at inference. One effective method to improve both the compute …

SPDY: Accurate pruning with speedup guarantees

E Frantar, D Alistarh - International Conference on Machine …, 2022 - proceedings.mlr.press
The recent focus on the efficiency of deep neural networks (DNNs) has led to significant
work on model compression approaches, of which weight pruning is one of the most …

LeanFlex-GKP: Advancing Hassle-Free Structured Pruning with Simple Flexible Group Count

J Zhang, S Zhong, A Ye, Z Liu, K Zhou… - … on Advancing Neural …, 2023 - openreview.net
Densely structured pruning methods—which generate pruned models in a fully dense
format, allowing immediate compression benefits without additional demands—are evolving …

[PDF][PDF] Effective methods for deep neural network sparsification

A Glentis Georgoulakis - 2024 - dspace.lib.ntua.gr
Abstract In recent years, Deep Neural Networks (DNNs) have significantly advanced the
state-of-the-art in numerous machine learning tasks. Unfortunately, most compact devices …

Survey of Model Pruning Algorithms

LI Yi, WEI Jian-guo, LIU Guan-wei - Computer and Modernization, 2022 - cam.org.cn
The model pruning algorithms apply different standards or methods to prune the redundant
neurons in the deep neural network, which can compress the model to the maximum extent …