- 学术资源搜索

A survey on deep neural network pruning: Taxonomy, comparison, analysis, and recommendations

H Cheng, M Zhang, JQ Shi - IEEE Transactions on Pattern …, 2024 - ieeexplore.ieee.org

Modern deep neural networks, particularly recent large language models, come with
massive model sizes that require significant computational and storage resources. To …

被引用次数：104 相关文章所有 2 个版本

[PDF] arxiv.org

Structured pruning for deep convolutional neural networks: A survey

Y He, L Xiao - IEEE transactions on pattern analysis and …, 2023 - ieeexplore.ieee.org

The remarkable performance of deep Convolutional neural networks (CNNs) is generally
attributed to their deeper and wider architectures, which can come with significant …

被引用次数：146 相关文章所有 7 个版本

[PDF] jmlr.org

Sparsity in deep learning: Pruning and growth for efficient inference and training in neural networks

T Hoefler, D Alistarh, T Ben-Nun, N Dryden… - Journal of Machine …, 2021 - jmlr.org

The growing energy and performance costs of deep learning have driven the community to
reduce the size of neural networks by selectively pruning components. Similarly to their …

被引用次数：843 相关文章所有 27 个版本

[PDF] arxiv.org

Rethinking attention with performers

K Choromanski, V Likhosherstov, D Dohan… - arXiv preprint arXiv …, 2020 - arxiv.org

We introduce Performers, Transformer architectures which can estimate regular (softmax)
full-rank-attention Transformers with provable accuracy, but using only linear (as opposed to …

被引用次数：1736 相关文章所有 8 个版本

[PDF] neurips.cc

Pruning neural networks without any data by iteratively conserving synaptic flow

H Tanaka, D Kunin, DL Yamins… - Advances in neural …, 2020 - proceedings.neurips.cc

Pruning the parameters of deep neural networks has generated intense interest due to
potential savings in time, memory and energy both during training and at test time. Recent …

被引用次数：726 相关文章所有 8 个版本

[PDF] neurips.cc

The lottery ticket hypothesis for pre-trained bert networks

T Chen, J Frankle, S Chang, S Liu… - Advances in neural …, 2020 - proceedings.neurips.cc

In natural language processing (NLP), enormous pre-trained models like BERT have
become the standard starting point for training on a range of downstream tasks, and similar …

被引用次数：403 相关文章所有 9 个版本

[PDF] aaai.org

On the effectiveness of parameter-efficient fine-tuning

Z Fu, H Yang, AMC So, W Lam, L Bing… - Proceedings of the AAAI …, 2023 - ojs.aaai.org

Fine-tuning pre-trained models has been ubiquitously proven to be effective in a wide range
of NLP tasks. However, fine-tuning the whole model is parameter inefficient as it always …

被引用次数：168 相关文章所有 5 个版本

[PDF] neurips.cc

Chasing sparsity in vision transformers: An end-to-end exploration

T Chen, Y Cheng, Z Gan, L Yuan… - Advances in Neural …, 2021 - proceedings.neurips.cc

Vision transformers (ViTs) have recently received explosive popularity, but their enormous
model sizes and training costs remain daunting. Conventional post-training pruning often …

被引用次数：216 相关文章所有 8 个版本

[PDF] mlr.press

A unified lottery ticket hypothesis for graph neural networks

T Chen, Y Sui, X Chen, A Zhang… - … conference on machine …, 2021 - proceedings.mlr.press

With graphs rapidly growing in size and deeper graph neural networks (GNNs) emerging,
the training and inference of GNNs become increasingly expensive. Existing network weight …

被引用次数：198 相关文章所有 5 个版本

[PDF] neurips.cc

Sparse training via boosting pruning plasticity with neuroregeneration

S Liu, T Chen, X Chen, Z Atashgahi… - Advances in …, 2021 - proceedings.neurips.cc

Works on lottery ticket hypothesis (LTH) and single-shot network pruning (SNIP) have raised
a lot of attention currently on post-training pruning (iterative magnitude pruning), and before …

被引用次数：130 相关文章所有 14 个版本

高级搜索

QQ 群

A survey on deep neural network pruning: Taxonomy, comparison, analysis, and recommendations

Structured pruning for deep convolutional neural networks: A survey

Sparsity in deep learning: Pruning and growth for efficient inference and training in neural networks

Rethinking attention with performers

Pruning neural networks without any data by iteratively conserving synaptic flow

The lottery ticket hypothesis for pre-trained bert networks

On the effectiveness of parameter-efficient fine-tuning

Chasing sparsity in vision transformers: An end-to-end exploration

A unified lottery ticket hypothesis for graph neural networks

Sparse training via boosting pruning plasticity with neuroregeneration

引用