Spvit: Enabling faster vision transformers via latency-aware soft token pruning

Z Kong, P Dong, X Ma, X Meng, W Niu, M Sun… - European conference on …, 2022 - Springer
Abstract Recently, Vision Transformer (ViT) has continuously established new milestones in
the computer vision field, while the high computation and memory cost makes its …

Quantumnas: Noise-adaptive search for robust quantum circuits

H Wang, Y Ding, J Gu, Y Lin, DZ Pan… - … Symposium on High …, 2022 - ieeexplore.ieee.org
Quantum noise is the key challenge in Noisy Intermediate-Scale Quantum (NISQ)
computers. Previous work for mitigating noise has primarily focused on gate-level or pulse …

Forms: Fine-grained polarized reram-based in-situ computation for mixed-signal dnn accelerator

G Yuan, P Behnam, Z Li, A Shafiee… - 2021 ACM/IEEE 48th …, 2021 - ieeexplore.ieee.org
Recent work demonstrated the promise of using resistive random access memory (ReRAM)
as an emerging technology to perform inherently parallel analog domain in-situ matrix …

Towards sparsification of graph neural networks

H Peng, D Gurevin, S Huang, T Geng… - 2022 IEEE 40th …, 2022 - ieeexplore.ieee.org
As real-world graphs expand in size, larger GNN models with billions of parameters are
deployed. High parameter count in such models makes training and inference on graphs …

Architecting decentralization and customizability in dnn accelerators for hardware defect adaptation

E Ozen, A Orailoglu - … on Computer-Aided Design of Integrated …, 2022 - ieeexplore.ieee.org
The efficiency of machine intelligence techniques has improved noticeably in the embedded
application domains thanks to the dedicated hardware accelerators for deep neural …

Fault-tolerant spiking neural network mapping algorithm and architecture to 3D-NoC-based neuromorphic systems

WY Yerima, OM Ikechukwu, KN Dang… - IEEE Access, 2023 - ieeexplore.ieee.org
Neuromorphic computing uses spiking neuron network models to solve machine learning
problems in a more energy-efficient way when compared to conventional artificial neural …

A design methodology for fault-tolerant computing using astrocyte neural networks

M Isik, A Paul, ML Varshika, A Das - Proceedings of the 19th ACM …, 2022 - dl.acm.org
We propose a design methodology to facilitate fault tolerance of deep learning models. First,
we implement a many-core fault-tolerant neuromorphic hardware design, where neuron and …

You Already Have It: A Generator-Free Low-Precision DNN Training Framework Using Stochastic Rounding

G Yuan, SE Chang, Q Jin, A Lu, Y Li, Y Wu… - … on Computer Vision, 2022 - Springer
Stochastic rounding is a critical technique used in low-precision deep neural networks
(DNNs) training to ensure good model accuracy. However, it requires a large number of …

R-MaS3N: Robust Mapping of Spiking Neural Networks to 3D-NoC-Based Neuromorphic Systems for Enhanced Reliability

WY Yerima, KN Dang, AB Abdallah - IEEE Access, 2023 - ieeexplore.ieee.org
Neuromorphic computing utilizes spiking neural networks (SNNs) to offer power/energy-
efficient solutions for complex machine-learning problems in hardware. However, neural …

3D-FPIM: An extreme energy-efficient DNN acceleration system using 3D NAND flash-based in-situ PIM unit

H Lee, M Kim, D Min, J Kim, J Back… - 2022 55th IEEE/ACM …, 2022 - ieeexplore.ieee.org
The crossbar structure of the nonvolatile memory enables highly parallel and energy-
efficient analog matrix-vector-multiply (MVM) operations. To exploit its efficiency, existing …