查看文章

arxiv.org 中的 [PDF]

Dynamic Model Pruning with Feedback

作者

Tao Lin, Sebastian U. Stich, Luis Barba, Daniil Dmitriev, Martin Jaggi

发表日期

2020

期刊

ICLR 2020 - International Conference on Learning Representations

简介

Deep neural networks often have millions of parameters. This can hinder their deployment to low-end devices, not only due to high memory requirements but also because of increased latency at inference. We propose a novel model compression method that generates a sparse trained model without additional overhead: by allowing (i) dynamic allocation of the sparsity pattern and (ii) incorporating feedback signal to reactivate prematurely pruned weights we obtain a performant sparse model in one single training pass (retraining is not needed, but can further improve the performance). We evaluate our method on CIFAR-10 and ImageNet, and show that the obtained sparse models can reach the state-of-the-art performance of dense models. Moreover, their performance surpasses that of models generated by all previously proposed pruning schemes.

引用总数

被引用次数：230

2019202020212022202320241 20 35 59 68 47

学术搜索中的文章

Dynamic model pruning with feedback

T Lin, SU Stich, L Barba, D Dmitriev, M Jaggi - arXiv preprint arXiv:2006.07253, 2020

被引用次数：230 相关文章所有 8 个版本