作者
Bert Moons, Marian Verhelst
发表日期
2016/6/15
研讨会论文
2016 IEEE Symposium on VLSI Circuits (VLSI-Circuits)
页码范围
1-2
出版商
IEEE
简介
A low-power precision-scalable processor for ConvNets or convolutional neural networks (CNN) is implemented in a 40nm technology. Its 256 parallel processing units achieve a peak 102GOPS running at 204MHz. To minimize energy consumption while maintaining throughput, this works is the first to both exploit the sparsity of convolutions and to implement dynamic precision-scalability enabling supply- and energy scaling. The processor is fully C-programmable, consumes 25-288mW at 204 MHz and scales efficiency from 0.3-2.6 real TOPS/W. This system hereby outperforms the state-of-the-art up to 3.9× in energy efficiency.
引用总数
2016201720182019202020212022202320243323643363017164
学术搜索中的文章