Challenges and trends of nonvolatile in-memory-computation circuits for AI edge devices

JM Hung, CJ Jhang, PC Wu, YC Chiu… - IEEE Open Journal of …, 2021 - ieeexplore.ieee.org
Nonvolatile memory (NVM)-based computing-in-memory (nvCIM) is a promising candidate
for artificial intelligence (AI) edge devices to overcome the latency and energy consumption …

Micro/Nano Circuits and Systems Design and Design Automation: Challenges and Opportunities [Point of View]

G Cauwenberghs, J Cong, XS Hu, S Joshi… - Proceedings of the …, 2023 - ieeexplore.ieee.org
The field of design and design automation of micro-/nano-circuits and systems has played a
pivotal role in advancing information technologies that are an inseparable part of all our …

CHIMERA: A 0.92-TOPS, 2.2-TOPS/W edge AI accelerator with 2-MByte on-chip foundry resistive RAM for efficient training and inference

K Prabhu, A Gural, ZF Khan… - IEEE Journal of Solid …, 2022 - ieeexplore.ieee.org
Implementing edge artificial intelligence (AI) inference and training is challenging with
current memory technologies. As deep neural networks (DNNs) grow in size, this problem is …

Nvmexplorer: A framework for cross-stack comparisons of embedded non-volatile memories

L Pentecost, A Hankin, M Donato, M Hempstead… - arXiv preprint arXiv …, 2021 - arxiv.org
Repeated off-chip memory accesses to DRAM drive up operating power for data-intensive
applications, and SRAM technology scaling and leakage power limits the efficiency of …

RRAM-DNN: An RRAM and model-compression empowered all-weights-on-chip DNN accelerator

Z Li, Z Wang, L Xu, Q Dong, B Liu, CI Su… - IEEE Journal of Solid …, 2020 - ieeexplore.ieee.org
This article presents an energy-efficient deep neural network (DNN) accelerator with non-
volatile embedded resistive random access memory (RRAM) for mobile machine learning …

RoboVisio: A Micro-Robot Vision Domain-Specific SoC for Autonomous Navigation Enabling Fully-on-Chip Intelligence via 2-MB eMRAM

Q Zhang, Z Fan, H An, Z Wang, Z Li… - IEEE Journal of Solid …, 2024 - ieeexplore.ieee.org
This article presents RoboVisio, an efficient and highly flexible domain-specific system-on-
chip (SoC) for vision tasks in fully autonomous micro-robot navigation. A novel hybrid …

Synchronous weight quantization-compression for low-bit quantized neural network

Y Jiao, S Li, X Huo, YK Li - 2021 International Joint Conference …, 2021 - ieeexplore.ieee.org
Deep neural networks (DNNs) usually have multiple layers and thousands of trainable
parameters to ensure high accuracy. Due to the requirement of large amounts of …

[PDF][PDF] Computational Memory Design

R Khaddam-Aljameh - 2022 - research-collection.ethz.ch
In recent years, we have witnessed explosive growth in machine learning and deep learning-
related applications. In particular, deep artificial neural networks (DNNs) have shown …

Domain-Specific Acceleration: From Efficient Vision Processing Hardware to High-Performance Quantum Computing Software

Q Zhang - 2024 - deepblue.lib.umich.edu
With the end of Dennard scaling and the decline of Moore's law, there are no longer
'free'performance and efficiency gains from semiconductor technology advancements …

Architecting High Performance Silicon Systems for Accurate and Efficient On-Chip Deep Learning

T Tambe - 2023 - search.proquest.com
The unabated pursuit of omniscient and omnipotent AI is levying hefty latency, memory, and
energy taxes at all computing scales. At the same time, the twilight of Dennard scaling …