Theoretical properties for neural networks with weight matrices of low displacement rank

HH Zhu, J Zou, H Zhang, YZ Shi, SB Luo… - Nature …, 2022 - nature.com

Large-scale, highly integrated and low-power-consuming hardware is becoming
progressively more important for realizing optical neural networks (ONNs) capable of …

被引用次数：196 相关文章所有 14 个版本

[PDF] acm.org

CirCNN: accelerating and compressing deep neural networks using block-circulant weight matrices

C Ding, S Liao, Y Wang, Z Li, N Liu, Y Zhuo… - Proceedings of the 50th …, 2017 - dl.acm.org

Large-scale deep neural networks (DNNs) are both compute and memory intensive. As the
size of DNNs continues to grow, it is critical to improve the energy efficiency and …

被引用次数：325 相关文章所有 13 个版本

[PDF] arxiv.org

Ftrans: energy-efficient acceleration of transformers using fpga

B Li, S Pandey, H Fang, Y Lyv, J Li, J Chen… - Proceedings of the …, 2020 - dl.acm.org

In natural language processing (NLP), the" Transformer" architecture was proposed as the
first transduction model replying entirely on self-attention mechanisms without using …

被引用次数：137 相关文章所有 4 个版本

[PDF] acm.org

C-LSTM: Enabling efficient LSTM using structured compression techniques on FPGAs

S Wang, Z Li, C Ding, B Yuan, Q Qiu, Y Wang… - Proceedings of the …, 2018 - dl.acm.org

Recently, significant accuracy improvement has been achieved for acoustic recognition
systems by increasing the model size of Long Short-Term Memory (LSTM) networks …

被引用次数：240 相关文章所有 12 个版本

[PDF] acm.org

REQ-YOLO: A resource-aware, efficient quantization framework for object detection on FPGAs

C Ding, S Wang, N Liu, K Xu, Y Wang… - proceedings of the 2019 …, 2019 - dl.acm.org

Deep neural networks (DNNs), as the basis of object detection, will play a key role in the
development of future autonomous systems with full autonomy. The autonomous systems …

被引用次数：112 相关文章所有 6 个版本

[PDF] arxiv.org

E-RNN: Design optimization for efficient recurrent neural networks in FPGAs

Z Li, C Ding, S Wang, W Wen, Y Zhuo… - … Symposium on High …, 2019 - ieeexplore.ieee.org

Recurrent Neural Networks (RNNs) are becoming increasingly important for time series-
related applications which require efficient and real-time implementations. The two major …

被引用次数：83 相关文章所有 9 个版本

[PDF] arxiv.org

Data-dependent coresets for compressing neural networks with applications to generalization bounds

C Baykal, L Liebenwein, I Gilitschenski… - arXiv preprint arXiv …, 2018 - arxiv.org

We present an efficient coresets-based neural network compression algorithm that sparsifies
the parameters of a trained fully-connected neural network in a manner that provably …

被引用次数：92 相关文章所有 9 个版本

[PDF] github.io

[PDF][PDF] Adam-admm: A unified, systematic framework of structured weight pruning for dnns

T Zhang, K Zhang, S Ye, J Li, J Tang… - arXiv preprint arXiv …, 2018 - yeshaokai.github.io

Weight pruning methods of deep neural networks (DNNs) have been demonstrated to
achieve a good model pruning ratio without loss of accuracy, thereby alleviating the …

被引用次数：76 相关文章所有 2 个版本

[PDF] ieee.org

Structadmm: Achieving ultrahigh efficiency in structured pruning for dnns

T Zhang, S Ye, X Feng, X Ma, K Zhang… - IEEE transactions on …, 2021 - ieeexplore.ieee.org

Weight pruning methods of deep neural networks (DNNs) have been demonstrated to
achieve a good model pruning rate without loss of accuracy, thereby alleviating the …

被引用次数：39 相关文章所有 7 个版本

[PDF] arxiv.org

A unified framework of DNN weight pruning and weight clustering/quantization using ADMM

S Ye, T Zhang, K Zhang, J Li, J Xie, Y Liang… - arXiv preprint arXiv …, 2018 - arxiv.org

Many model compression techniques of Deep Neural Networks (DNNs) have been
investigated, including weight pruning, weight clustering and quantization, etc. Weight …

被引用次数：66 相关文章所有 2 个版本

高级搜索

QQ 群