A survey on deep neural network pruning: Taxonomy, comparison, analysis, and recommendations

H Cheng, M Zhang, JQ Shi - IEEE Transactions on Pattern …, 2024 - ieeexplore.ieee.org
Modern deep neural networks, particularly recent large language models, come with
massive model sizes that require significant computational and storage resources. To …

Machine-learning-enabled cooperative perception for connected autonomous vehicles: Challenges and opportunities

Q Yang, S Fu, H Wang, H Fang - IEEE Network, 2021 - ieeexplore.ieee.org
Connected and autonomous vehicles is a disruptive technology that has the potential to
transform the current transportation system by reducing traffic accidents and enhancing …

Toward transparent ai: A survey on interpreting the inner structures of deep neural networks

T Räuker, A Ho, S Casper… - 2023 ieee conference …, 2023 - ieeexplore.ieee.org
The last decade of machine learning has seen drastic increases in scale and capabilities.
Deep neural networks (DNNs) are increasingly being deployed in the real world. However …

Revisiting random channel pruning for neural network compression

Y Li, K Adamczewski, W Li, S Gu… - Proceedings of the …, 2022 - openaccess.thecvf.com
Channel (or 3D filter) pruning serves as an effective way to accelerate the inference of
neural networks. There has been a flurry of algorithms that try to solve this practical problem …

Fedmask: Joint computation and communication-efficient personalized federated learning via heterogeneous masking

A Li, J Sun, X Zeng, M Zhang, H Li, Y Chen - Proceedings of the 19th …, 2021 - dl.acm.org
Recent advancements in deep neural networks (DNN) enabled various mobile deep
learning applications. However, it is technically challenging to locally train a DNN model due …

Group sparsity: The hinge between filter pruning and decomposition for network compression

Y Li, S Gu, C Mayer, LV Gool… - Proceedings of the …, 2020 - openaccess.thecvf.com
In this paper, we analyze two popular network compression techniques, ie filter pruning and
low-rank decomposition, in a unified sense. By simply changing the way the sparsity …

Only train once: A one-shot neural network training and pruning framework

T Chen, B Ji, T Ding, B Fang, G Wang… - Advances in …, 2021 - proceedings.neurips.cc
Structured pruning is a commonly used technique in deploying deep neural networks
(DNNs) onto resource-constrained devices. However, the existing pruning methods are …

Methods for pruning deep neural networks

S Vadera, S Ameen - IEEE Access, 2022 - ieeexplore.ieee.org
This paper presents a survey of methods for pruning deep neural networks. It begins by
categorising over 150 studies based on the underlying approach used and then focuses on …

Dhp: Differentiable meta pruning via hypernetworks

Y Li, S Gu, K Zhang, L Van Gool, R Timofte - Computer Vision–ECCV 2020 …, 2020 - Springer
Network pruning has been the driving force for the acceleration of neural networks and the
alleviation of model storage/transmission burden. With the advent of AutoML and neural …

Multi-scale aligned distillation for low-resolution detection

L Qi, J Kuen, J Gu, Z Lin, Y Wang… - Proceedings of the …, 2021 - openaccess.thecvf.com
In instance-level detection tasks (eg, object detection), reducing input resolution is an easy
option to improve runtime efficiency. However, this option severely hurts the detection …