作者
Bert Moons, Koen Goetschalckx, Nick Van Berckelaer, Marian Verhelst
发表日期
2017/10/29
研讨会论文
2017 51st Asilomar Conference on Signals, Systems, and Computers
页码范围
1921-1925
出版商
IEEE
简介
This work targets the automated minimum-energy optimization of Quantized Neural Networks (QNNs) - networks using low precision weights and activations. These networks are trained from scratch at an arbitrary fixed point precision. At iso-accuracy, QNNs using fewer bits require deeper and wider network architectures than networks using higher precision operators, while they require less complex arithmetic and less bits per weights. This fundamental trade-off is analyzed and quantified to find the minimum energy QNN for any benchmark and hence optimize energy-efficiency. To this end, the energy consumption of inference is modeled for a generic hardware platform. This allows drawing several conclusions across different benchmarks. First, energy consumption varies orders of magnitude at iso-accuracy depending on the number of bits used in the QNN. Second, in a typical system, BinaryNets or int4 …
引用总数
201820192020202120222023202410213131283016
学术搜索中的文章
B Moons, K Goetschalckx, N Van Berckelaer… - 2017 51st Asilomar Conference on Signals, Systems …, 2017