Energon: Toward efficient acceleration of transformers using dynamic sparse attention

Z Zhou, J Liu, Z Gu, G Sun - IEEE Transactions on Computer …, 2022 - ieeexplore.ieee.org
In recent years, transformer models have revolutionized natural language processing (NLP)
and shown promising performance on computer vision (CV) tasks. Despite their …

QuantMAC: Enhancing Hardware Performance in DNNs With Quantize Enabled Multiply-Accumulate Unit

N Ashar, G Raut, V Trivedi, SK Vishvakarma… - IEEE …, 2024 - ieeexplore.ieee.org
In response to the escalating demand for hardware-efficient Deep Neural Network (DNN)
architectures, we present a novel quantize-enabled multiply-accumulate (MAC) unit. Our …

SET Effects on Quasi Delay Insensitive and Synchronous Circuits

Z Tabassam, A Steininger - 2023 IEEE European Test …, 2023 - ieeexplore.ieee.org
Due to their unbounded data accepting windows asynchronous circuits seem to be more
susceptible to environmental effects than their synchronous counterparts with their strict data …

Advancing machine learning tasks with field-programmable gate arrays: advantages, applications, challenges, and future perspectives

D Ma - … International Conference on Electrical, Electronics, and …, 2024 - spiedigitallibrary.org
This article comprehensively explores the applications, advantages, and challenges of using
Field-Programmable Gate Arrays (FPGAs) to enhance machine learning tasks. It fills a gap …

Stream Processor Development using Multi-Threshold NULL Convention Logic Asynchronous Design Methodology

W Khalil - 2023 - search.proquest.com
Decreasing transistor feature size has led to an increase in the number of transistors in
integrated circuits (IC), allowing for the implementation of more complex logic. However …