A survey on deep neural network pruning: Taxonomy, comparison, analysis, and recommendations

H Cheng, M Zhang, JQ Shi - IEEE Transactions on Pattern …, 2024 - ieeexplore.ieee.org
Modern deep neural networks, particularly recent large language models, come with
massive model sizes that require significant computational and storage resources. To …

Efficient Deep Learning Infrastructures for Embedded Computing Systems: A Comprehensive Survey and Future Envision

X Luo, D Liu, H Kong, S Huai, H Chen… - ACM Transactions on …, 2024 - dl.acm.org
Deep neural networks (DNNs) have recently achieved impressive success across a wide
range of real-world vision and language processing tasks, spanning from image …

LR-CNN: Lightweight Row-centric Convolutional Neural Network Training for Memory Reduction

Z Wang, H Yang, N Wang, C Xu, J Nie, Z Wei… - arXiv preprint arXiv …, 2024 - arxiv.org
In the last decade, Convolutional Neural Network with a multi-layer architecture has
advanced rapidly. However, training its complex network is very space-consuming, since a …

Rethinking Imbalance in Image Super-Resolution for Efficient Inference

W Yu, B Yang, Q Liu, J Li, S Zhang, X Ji - The Thirty-eighth Annual … - openreview.net
Existing super-resolution (SR) methods optimize all model weights equally using $\mathcal
{L} _1 $ or $\mathcal {L} _2 $ losses by uniformly sampling image patches without …