- 学术资源搜索

A review of convolutional neural network architectures and their optimizations

S Cong, Y Zhou - Artificial Intelligence Review, 2023 - Springer

The research advances concerning the typical architectures of convolutional neural
networks (CNNs) as well as their optimizations are analyzed and elaborated in detail in this …

被引用次数：90 相关文章所有 5 个版本

A comprehensive survey on model compression and acceleration

T Choudhary, V Mishra, A Goswami… - Artificial Intelligence …, 2020 - Springer

In recent years, machine learning (ML) and deep learning (DL) have shown remarkable
improvement in computer vision, natural language processing, stock prediction, forecasting …

被引用次数：395 相关文章所有 8 个版本

[PDF] neurips.cc

Llm-pruner: On the structural pruning of large language models

X Ma, G Fang, X Wang - Advances in neural information …, 2023 - proceedings.neurips.cc

Large language models (LLMs) have shown remarkable capabilities in language
understanding and generation. However, such impressive capability typically comes with a …

被引用次数：187 相关文章所有 5 个版本

[PDF] thecvf.com

Blueprint separable residual network for efficient image super-resolution

Z Li, Y Liu, X Chen, H Cai, J Gu… - Proceedings of the …, 2022 - openaccess.thecvf.com

Recent advances in single image super-resolution (SISR) have achieved extraordinary
performance, but the computational cost is too heavy to apply in edge devices. To alleviate …

被引用次数：131 相关文章所有 8 个版本

[PDF] jmlr.org

Sparsity in deep learning: Pruning and growth for efficient inference and training in neural networks

T Hoefler, D Alistarh, T Ben-Nun, N Dryden… - Journal of Machine …, 2021 - jmlr.org

The growing energy and performance costs of deep learning have driven the community to
reduce the size of neural networks by selectively pruning components. Similarly to their …

被引用次数：642 相关文章所有 27 个版本

[PDF] arxiv.org

Pruning and quantization for deep neural network acceleration: A survey

T Liang, J Glossner, L Wang, S Shi, X Zhang - Neurocomputing, 2021 - Elsevier

Deep neural networks have been applied in many applications exhibiting extraordinary
abilities in the field of computer vision. However, complex network architectures challenge …

被引用次数：588 相关文章所有 6 个版本

[PDF] arxiv.org

Not all patches are what you need: Expediting vision transformers via token reorganizations

Y Liang, C Ge, Z Tong, Y Song, J Wang… - arXiv preprint arXiv …, 2022 - arxiv.org

Vision Transformers (ViTs) take all the image patches as tokens and construct multi-head
self-attention (MHSA) among them. Complete leverage of these image tokens brings …

被引用次数：238 相关文章所有 7 个版本

[PDF] arxiv.org

Toward transparent ai: A survey on interpreting the inner structures of deep neural networks

T Räuker, A Ho, S Casper… - 2023 ieee conference …, 2023 - ieeexplore.ieee.org

The last decade of machine learning has seen drastic increases in scale and capabilities.
Deep neural networks (DNNs) are increasingly being deployed in the real world. However …

被引用次数：122 相关文章所有 5 个版本

[PDF] arxiv.org

Binary neural networks: A survey

H Qin, R Gong, X Liu, X Bai, J Song, N Sebe - Pattern Recognition, 2020 - Elsevier

The binary neural network, largely saving the storage and computation, serves as a
promising technique for deploying deep models on resource-limited devices. However, the …

被引用次数：514 相关文章所有 9 个版本

[PDF] neurips.cc

A fast post-training pruning framework for transformers

W Kwon, S Kim, MW Mahoney… - Advances in …, 2022 - proceedings.neurips.cc

Pruning is an effective way to reduce the huge inference cost of Transformer models.
However, prior work on pruning Transformers requires retraining the models. This can add …

被引用次数：85 相关文章所有 9 个版本

高级搜索

QQ 群

A review of convolutional neural network architectures and their optimizations

A comprehensive survey on model compression and acceleration

Llm-pruner: On the structural pruning of large language models

Blueprint separable residual network for efficient image super-resolution

Sparsity in deep learning: Pruning and growth for efficient inference and training in neural networks

Pruning and quantization for deep neural network acceleration: A survey

Not all patches are what you need: Expediting vision transformers via token reorganizations

Toward transparent ai: A survey on interpreting the inner structures of deep neural networks

Binary neural networks: A survey

A fast post-training pruning framework for transformers

引用