RAELLA: Reforming the arithmetic for efficient, low-resolution, and low-loss analog PIM: No retraining required!

T Andrulis, JS Emer, V Sze - … of the 50th Annual International Symposium …, 2023 - dl.acm.org
Processing-In-Memory (PIM) accelerators have the potential to efficiently run Deep Neural
Network (DNN) inference by reducing costly data movement and by using resistive RAM …

Inca: Input-stationary dataflow at outside-the-box thinking about deep learning accelerators

B Kim, S Li, H Li - 2023 IEEE International Symposium on High …, 2023 - ieeexplore.ieee.org
This paper first presents an input-stationary (IS) implemented crossbar accelerator (INCA),
supporting inference and training for deep neural networks (DNNs). Processing-in-memory …

Era-bs: Boosting the efficiency of reram-based pim accelerator with fine-grained bit-level sparsity

F Liu, W Zhao, Z Wang, Y Chen… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Resistive Random-Access-Memory (ReRAM) crossbar is one of the most promising neural
network accelerators, thanks to its in-memory and in-situ analog computing abilities for …

Subgraph stationary hardware-software inference co-design

P Behnam, A Tumanov, T Krishna… - Proceedings of …, 2023 - proceedings.mlsys.org
A growing number of applications depend on Machine Learning (ML) functionality and
benefits from both higher quality ML predictions and better timeliness (latency) at the same …

Resistive neural hardware accelerators

K Smagulova, ME Fouda, F Kurdahi… - Proceedings of the …, 2023 - ieeexplore.ieee.org
Deep neural networks (DNNs), as a subset of machine learning (ML) techniques, entail that
real-world data can be learned, and decisions can be made in real time. However, their wide …

Energy-efficient reram-based ml training via mixed pruning and reconfigurable adc

C Ogbogu, M Soumen, BK Joardar… - 2023 IEEE/ACM …, 2023 - ieeexplore.ieee.org
Machine learning (ML) models have gained prominence in solving real-world tasks.
However, implementing ML models is both compute-and memory-intensive. Domain-specific …

RACE-IT: A reconfigurable analog CAM-crossbar engine for in-memory transformer acceleration

L Zhao, L Buonanno, RM Roth, S Serebryakov… - arXiv preprint arXiv …, 2023 - arxiv.org
Transformer models represent the cutting edge of Deep Neural Networks (DNNs) and excel
in a wide range of machine learning tasks. However, processing these models demands …

3A-ReRAM: Adaptive Activation Accumulation in ReRAM-Based CNN Accelerator

Z Zhang, J Jiang, Q Wang, Z Mao… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
ReRAM-based computing is good at accelerating convolutional neural network (CNN)
inference due to its high computing parallelism, but its rigid crossbar structure may become …

[HTML][HTML] PRAP-PIM: A weight pattern reusing aware pruning method for ReRAM-based PIM DNN accelerators

Z Shen, J Wu, X Jiang, Y Zhang, L Ju, Z Jia - High-Confidence Computing, 2023 - Elsevier
Abstract Resistive Random-Access Memory (ReRAM) based Processing-in-Memory (PIM)
frameworks are proposed to accelerate the working process of DNN models by eliminating …

Redy: A novel reram-centric dynamic quantization approach for energy-efficient cnn inference

M Sabri, M Riera, A González - arXiv preprint arXiv:2306.16298, 2023 - arxiv.org
The primary operation in DNNs is the dot product of quantized input activations and weights.
Prior works have proposed the design of memory-centric architectures based on the …