A survey of techniques for optimizing transformer inference

KT Chitty-Venkata, S Mittal, M Emani… - Journal of Systems …, 2023 - Elsevier
Recent years have seen a phenomenal rise in the performance and applications of
transformer neural networks. The family of transformer networks, including Bidirectional …

Lightweight Deep Learning for Resource-Constrained Environments: A Survey

HI Liu, M Galindo, H Xie, LK Wong, HH Shuai… - ACM Computing …, 2024 - dl.acm.org
Over the past decade, the dominance of deep learning has prevailed across various
domains of artificial intelligence, including natural language processing, computer vision …

Driver distraction detection using semi-supervised lightweight vision transformer

AAQ Mohammed, X Geng, J Wang, Z Ali - Engineering Applications of …, 2024 - Elsevier
The continuously increasing number of traffic accidents necessitates addressing distracted
driving, which is responsible for numerous fatalities. Enhancing driver behavior recognition …

Vision transformer computation and resilience for dynamic inference

K Sreedhar, J Clemons, R Venkatesan… - … Analysis of Systems …, 2024 - ieeexplore.ieee.org
State-of-the-art deep learning models for computer vision tasks are based on the transformer
architecture and often deployed in real-time applications. In this scenario, the resources …

Enabling and accelerating dynamic vision transformer inference for real-time applications

K Sreedhar, J Clemons, R Venkatesan… - arXiv preprint arXiv …, 2022 - arxiv.org
Many state-of-the-art deep learning models for computer vision tasks are based on the
transformer architecture. Such models can be computationally expensive and are typically …

Hardware accelerator for MobileViT vision transformer with reconfigurable computation

SF Hsiao, TH Chao, YC Yuan… - 2024 IEEE International …, 2024 - ieeexplore.ieee.org
With the great success of the Transformer model in Natural Language Processing (NLP),
Vision Transformer (ViT) was proposed achieving comparable performance to traditional …