Structured pruning for deep convolutional neural networks: A survey

Y He, L Xiao - IEEE transactions on pattern analysis and …, 2023 - ieeexplore.ieee.org
The remarkable performance of deep Convolutional neural networks (CNNs) is generally
attributed to their deeper and wider architectures, which can come with significant …

Transforming large-size to lightweight deep neural networks for IoT applications

R Mishra, H Gupta - ACM Computing Surveys, 2023 - dl.acm.org
Deep Neural Networks (DNNs) have gained unprecedented popularity due to their high-
order performance and automated feature extraction capability. This has encouraged …

Spvit: Enabling faster vision transformers via latency-aware soft token pruning

Z Kong, P Dong, X Ma, X Meng, W Niu, M Sun… - European conference on …, 2022 - Springer
Abstract Recently, Vision Transformer (ViT) has continuously established new milestones in
the computer vision field, while the high computation and memory cost makes its …

Mest: Accurate and fast memory-economic sparse training framework on the edge

G Yuan, X Ma, W Niu, Z Li, Z Kong… - Advances in …, 2021 - proceedings.neurips.cc
Recently, a new trend of exploring sparsity for accelerating neural network training has
emerged, embracing the paradigm of training on the edge. This paper proposes a novel …

Chex: Channel exploration for cnn model compression

Z Hou, M Qin, F Sun, X Ma, K Yuan… - Proceedings of the …, 2022 - openaccess.thecvf.com
Channel pruning has been broadly recognized as an effective technique to reduce the
computation and memory cost of deep convolutional neural networks. However …

Forms: Fine-grained polarized reram-based in-situ computation for mixed-signal dnn accelerator

G Yuan, P Behnam, Z Li, A Shafiee… - 2021 ACM/IEEE 48th …, 2021 - ieeexplore.ieee.org
Recent work demonstrated the promise of using resistive random access memory (ReRAM)
as an emerging technology to perform inherently parallel analog domain in-situ matrix …

Accelerating federated learning for iot in big data analytics with pruning, quantization and selective updating

W Xu, W Fang, Y Ding, M Zou, N Xiong - IEEE Access, 2021 - ieeexplore.ieee.org
The ever-increasing number of Internet of Things (IoT) devices are continuously generating
huge masses of data, but the current cloud-centric approach for IoT big data analysis has …

Hardware acceleration of sparse and irregular tensor computations of ml models: A survey and insights

S Dave, R Baghdadi, T Nowatzki… - Proceedings of the …, 2021 - ieeexplore.ieee.org
Machine learning (ML) models are widely used in many important domains. For efficiently
processing these computational-and memory-intensive applications, tensors of these …

Teachers do more than teach: Compressing image-to-image models

Q Jin, J Ren, OJ Woodford, J Wang… - Proceedings of the …, 2021 - openaccess.thecvf.com
Abstract Generative Adversarial Networks (GANs) have achieved huge success in
generating high-fidelity images, however, they suffer from low efficiency due to tremendous …

Achieving on-mobile real-time super-resolution with neural architecture and pruning search

Z Zhan, Y Gong, P Zhao, G Yuan… - Proceedings of the …, 2021 - openaccess.thecvf.com
Though recent years have witnessed remarkable progress in single image super-resolution
(SISR) tasks with the prosperous development of deep neural networks (DNNs), the deep …