Binaryduo: Reducing gradient mismatch in binary activation network by coupling binary activations

C Yuan, SS Agaian - Artificial Intelligence Review, 2023 - Springer

Deep learning (DL) has recently changed the development of intelligent systems and is
widely adopted in many real-life applications. Despite their various benefits and potentials …

被引用次数：118 相关文章所有 6 个版本

End-edge-cloud collaborative computing for deep learning: A comprehensive survey

Y Wang, C Yang, S Lan, L Zhu… - … Surveys & Tutorials, 2024 - ieeexplore.ieee.org

The booming development of deep learning applications and services heavily relies on
large deep learning models and massive data in the cloud. However, cloud-based deep …

被引用次数：30 相关文章

[PDF] arxiv.org

A survey of quantization methods for efficient neural network inference

A Gholami, S Kim, Z Dong, Z Yao… - Low-Power Computer …, 2022 - taylorfrancis.com

This chapter provides approaches to the problem of quantizing the numerical values in deep
Neural Network computations, covering the advantages/disadvantages of current methods …

被引用次数：1329 相关文章所有 4 个版本

[PDF] arxiv.org

Binarybert: Pushing the limit of bert quantization

H Bai, W Zhang, L Hou, L Shang, J Jin, X Jiang… - arXiv preprint arXiv …, 2020 - arxiv.org

The rapid development of large pre-trained language models has greatly increased the
demand for model compression techniques, among which quantization is a popular solution …

被引用次数：238 相关文章所有 4 个版本

[PDF] springer.com

Loss aware post-training quantization

Y Nahshan, B Chmiel, C Baskin, E Zheltonozhskii… - Machine Learning, 2021 - Springer

Neural network quantization enables the deployment of large models on resource-
constrained devices. Current post-training quantization methods fall short in terms of …

被引用次数：187 相关文章所有 8 个版本

[PDF] ieee.org

A systematic literature review on binary neural networks

R Sayed, H Azmi, H Shawkey, AH Khalil… - IEEE Access, 2023 - ieeexplore.ieee.org

This paper presents an extensive literature review on Binary Neural Network (BNN). BNN
utilizes binary weights and activation function parameters to substitute the full-precision …

被引用次数：29 相关文章所有 3 个版本

[PDF] aaai.org

Spiking neural networks with improved inherent recurrence dynamics for sequential learning

W Ponghiran, K Roy - Proceedings of the AAAI Conference on Artificial …, 2022 - ojs.aaai.org

Spiking neural networks (SNNs) with leaky integrate and fire (LIF) neurons, can be operated
in an event-driven manner and have internal states to retain information over time, providing …

被引用次数：48 相关文章所有 7 个版本

[PDF] thecvf.com

Improving accuracy of binary neural networks using unbalanced activation distribution

H Kim, J Park, C Lee, JJ Kim - Proceedings of the IEEE/CVF …, 2021 - openaccess.thecvf.com

Binarization of neural network models is considered as one of the promising methods to
deploy deep neural network models on resource-constrained environments such as mobile …

被引用次数：48 相关文章所有 5 个版本

[PDF] iop.org

Investigating the effect of circuit cutting in QAOA for the MaxCut problem on NISQ devices

M Bechtold, J Barzen, F Leymann… - Quantum Science …, 2023 - iopscience.iop.org

Noisy intermediate-scale quantum (NISQ) devices are restricted by their limited number of
qubits and their short decoherence times. An approach addressing these problems is …

被引用次数：27 相关文章所有 4 个版本

[PDF] aaai.org

Ompq: Orthogonal mixed precision quantization

Y Ma, T Jin, X Zheng, Y Wang, H Li, Y Wu… - Proceedings of the …, 2023 - ojs.aaai.org

To bridge the ever-increasing gap between deep neural networks' complexity and hardware
capability, network quantization has attracted more and more research attention. The latest …

被引用次数：36 相关文章所有 4 个版本

高级搜索

QQ 群