Toward efficient low-precision training: Data format optimization and hysteresis quantization

J Dotzel, G Wu, A Li, M Umar, Y Ni… - arXiv preprint arXiv …, 2023 - arxiv.org

Quantization has become a mainstream compression technique for reducing model size,
computational requirements, and energy consumption for modern deep neural networks …

被引用次数：1 相关文章所有 3 个版本

PL-NPU: An energy-efficient edge-device DNN training processor with posit-based logarithm-domain computing

Y Wang, D Deng, L Liu, S Wei… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org

Edge device deep neural network (DNN) training is practical to improve model adaptivity for
unfamiliar datasets while avoiding privacy disclosure and huge communication cost …

被引用次数：15 相关文章

[PDF] hal.science

On-device deep learning: survey on techniques improving energy efficiency of DNNs

A Boumendil, W Bechkit… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

Providing high-quality predictions is no longer the sole goal for neural networks. As we live
in an increasingly interconnected world, these models need to match the constraints of …

Gradient distribution-aware INT8 training for neural networks

S Wang, Y Kang - Neurocomputing, 2023 - Elsevier

Recently, low bit-width quantization (eg, INT8) has been commonly used in deep neural
network inference acceleration, but fewer researchers have focused on low-precision …

被引用次数：5 相关文章所有 3 个版本

A 4.27 TFLOPS/W FP4/FP8 Hybrid-Precision Neural Network Training Processor Using Shift-Add MAC and Reconfigurable PE Array

S Lee, J Park, D Jeon - … 2023-IEEE 49th European Solid State …, 2023 - ieeexplore.ieee.org

This paper presents an energy-efficient FP4/FP8 hybrid-precision training processor.
Through hardware-software co-optimization, the design efficiently implements all general …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Exploiting the Partly Scratch-off Lottery Ticket for Quantization-Aware Training

Y Zhong, G Nan, Y Zhang, F Chao, R Ji - arXiv preprint arXiv:2211.08544, 2022 - arxiv.org

Quantization-aware training (QAT) receives extensive popularity as it well retains the
performance of quantized networks. In QAT, the contemporary experience is that all …

被引用次数：3 相关文章所有 2 个版本

[PDF] arxiv.org

[PDF] uq.edu.au

[引用][C] Multi-modal data modeling with awareness of efficiency, reliability, and privacy

P Zhang - 2023 - espace.library.uq.edu.au

Foundational to the digital economy, data and its accompanying analytical models today
play a critical role in the gaining of new insights from the digital world and the facilitation of …

高级搜索

QQ 群