CIMAT: A transpose SRAM-based compute-in-memory architecture for deep neural network on-chip...

S Mittal, G Verma, B Kaushik, FA Khanday - Journal of Systems …, 2021 - Elsevier

As von Neumann computing architectures become increasingly constrained by data-
movement overheads, researchers have started exploring in-memory computing (IMC) …

被引用次数：70 相关文章所有 4 个版本

[PDF] ieee.org

A systematic literature review on binary neural networks

R Sayed, H Azmi, H Shawkey, AH Khalil… - IEEE Access, 2023 - ieeexplore.ieee.org

This paper presents an extensive literature review on Binary Neural Network (BNN). BNN
utilizes binary weights and activation function parameters to substitute the full-precision …

被引用次数：29 相关文章所有 3 个版本

Evaluating machine learningworkloads on memory-centric computing systems

J Gómez-Luna, Y Guo, S Brocard… - … Analysis of Systems …, 2023 - ieeexplore.ieee.org

Training machine learning (ML) algorithms is a computationally intensive process, which is
frequently memory-bound due to repeatedly accessing large training datasets. As a result …

被引用次数：31 相关文章所有 3 个版本

CIMAT: A compute-in-memory architecture for on-chip training based on transpose SRAM arrays

H Jiang, X Peng, S Huang, S Yu - IEEE Transactions on …, 2020 - ieeexplore.ieee.org

Rapid development in deep neural networks (DNNs) is enabling many intelligent
applications. However, on-chip training of DNNs is challenging due to the extensive …

被引用次数：54 相关文章所有 3 个版本

[PDF] ieee.org

Accelerating deep neural network in-situ training with non-volatile and volatile memory based hybrid precision synapses

Y Luo, S Yu - IEEE Transactions on Computers, 2020 - ieeexplore.ieee.org

Compute-in-memory (CIM) with emerging non-volatile memories (eNVMs) is time and
energy efficient for deep neural network (DNN) inference. However, challenges still remain …

被引用次数：55 相关文章所有 4 个版本

A two-way SRAM array based accelerator for deep neural network on-chip training

H Jiang, S Huang, X Peng, JW Su… - 2020 57th ACM/IEEE …, 2020 - ieeexplore.ieee.org

On-chip training of large-scale deep neural networks (DNNs) is challenging due to
computational complexity and resource limitation. Compute-in-memory (CIM) architecture …

被引用次数：40 相关文章所有 2 个版本

[PDF] researching.cn

A review on SRAM-based computing in-memory: Circuits, functions, and applications

Z Lin, Z Tong, J Zhang, F Wang, T Xu… - Journal of …, 2022 - iopscience.iop.org

Artificial intelligence (AI) processes data-centric applications with minimal effort. However, it
poses new challenges to system design in terms of computational speed and energy …

被引用次数：29 相关文章所有 7 个版本

[PDF] arxiv.org

An Experimental Evaluation of Machine Learning Training on a Real Processing-in-Memory System

J Gómez-Luna, Y Guo, S Brocard, J Legriel… - arXiv preprint arXiv …, 2022 - arxiv.org

Training machine learning (ML) algorithms is a computationally intensive process, which is
frequently memory-bound due to repeatedly accessing large training datasets. As a result …

被引用次数：17 相关文章所有 3 个版本

[PDF] acm.org

Cambricon-u: A systolic random increment memory architecture for unary computing

H Guo, Y Zhao, Z Li, Y Hao, C Liu, X Song, X Li… - Proceedings of the 56th …, 2023 - dl.acm.org

Unary computing, whose arithmetics require only one logic gate, has enabled efficient DNN
processing, especially on strictly power-constrained devices. However, unary computing still …

被引用次数：2 相关文章所有 3 个版本

PIM-Opt: Demystifying Distributed Optimization Algorithms on a Real-World Processing-In-Memory System

S Rhyner, H Luo, J Gómez-Luna… - Proceedings of the …, 2024 - dl.acm.org

Modern Machine Learning (ML) training on large-scale datasets is a very time-consuming
workload. It relies on the optimization algorithm Stochastic Gradient Descent (SGD) due to …

被引用次数：2 相关文章所有 4 个版本

高级搜索

QQ 群