An energy-efficient matrix multiplication accelerator by distributed in-memory computing on binary RRAM crossbar

L Ni, Y Wang, H Yu, W Yang, C Weng… - 2016 21st Asia and …, 2016 - ieeexplore.ieee.org
L Ni, Y Wang, H Yu, W Yang, C Weng, J Zhao
2016 21st Asia and South Pacific Design Automation Conference (ASP …, 2016ieeexplore.ieee.org
Emerging resistive random-access memory (RRAM) can provide non-volatile memory
storage but also intrinsic logic for matrix-vector multiplication, which is ideal for low-power
and high-throughput data analytics accelerator performed in memory. However, the existing
RRAM-based computing device is mainly assumed on a multi-level analog computing,
whose result is sensitive to process non-uniformity as well as additional AD-conversion and
I/O overhead. This paper explores the data analytics accelerator on binary RRAM-crossbar …
Emerging resistive random-access memory (RRAM) can provide non-volatile memory storage but also intrinsic logic for matrix-vector multiplication, which is ideal for low-power and high-throughput data analytics accelerator performed in memory. However, the existing RRAM-based computing device is mainly assumed on a multi-level analog computing, whose result is sensitive to process non-uniformity as well as additional AD- conversion and I/O overhead. This paper explores the data analytics accelerator on binary RRAM-crossbar. Accordingly, one distributed in-memory computing architecture is proposed with design of according component and control protocol. Both memory array and logic accelerator can be implemented by RRAM-crossbar purely in binary, where logic-memory pairs can be distributed with protocol of control bus. Based on numerical results for fingerprint matching that is mapped on the proposed RRAM-crossbar, the proposed architecture has shown 2.86x faster speed, 154x better energy efficiency, and 100x smaller area when compared to the same design by CMOS-based ASIC.
ieeexplore.ieee.org
以上显示的是最相近的搜索结果。 查看全部搜索结果