Multi-exit DNN inference acceleration for intelligent terminal with heterogeneous processors

J Zhang, W Xin, D Lv, J Wang, G Cai, F Dong - … Computing: Informatics and …, 2023 - Elsevier
Recently, there has been a burgeoning popularity in the deployment of deep learning vision
applications upon terminal devices. However, as the number of layers in deep neural …

An Automatic Pipeline Parallel Acceleration Framework for Neural Network Models on Heterogeneous Computing Platforms

J Yang, X Long, Z Ma, C Yang - 2022 5th International …, 2022 - ieeexplore.ieee.org
With the widespread use of heterogeneous computing platforms in the AI field, neural
network models on heterogeneous computing platforms are experiencing problems such as …

Learning-Based Edge-Device Collaborative DNN Inference in IoVT Networks

X Xu, K Yan, S Han, B Wang, X Tao… - IEEE Internet of Things …, 2023 - ieeexplore.ieee.org
Deep Neural Network (DNN) is a promising technology for Internet of Visual Things (IoVT)
devices to extract their visual information from unstructured data. However, it is hard to …

Mema: Fast inference of multiple deep models

J Galjaard, B Cox, A Ghiassi, LY Chen… - … Workshops and other …, 2021 - ieeexplore.ieee.org
The execution of deep neural network (DNN) inference jobs on edge devices has become
increasingly popular. Multiple of such inference models can concurrently analyse the on …

Characterizing the I/O pipeline in the deployment of CNNs on commercial accelerators

J Li, Z Jiang, F Liu, X Dong, G Li… - 2020 IEEE Intl Conf …, 2020 - ieeexplore.ieee.org
Commercial AI accelerators are gaining popularity because of their high energy efficiency
for the inference of deep neural networks (DNNs). How to benchmark them remains hot …

Processor pipelining method for efficient deep neural network inference on embedded devices

A Parashar, A Abraham, D Chaudhary… - 2020 IEEE 27th …, 2020 - ieeexplore.ieee.org
Myriad applications of Deep Neural Networks (DNN) and the race for better accuracy have
paved the way for the development of more computationally intensive network architectures …

EFFECT-DNN: Energy-efficient Edge Framework for Real-time DNN Inference

X Zhang, M Mounesan, S Debroy - 2023 IEEE 24th …, 2023 - ieeexplore.ieee.org
Real-time visual computing applications running Deep Neural Networks (DNN) are
becoming popular for mission-critical use cases such as, disaster response, tactical …

Dynamic path based DNN synergistic inference acceleration in edge computing environment

M Zhou, B Zhou, H Wang, F Dong… - 2021 IEEE 27th …, 2021 - ieeexplore.ieee.org
Deep Neural Networks (DNNs) have achieved excellent performance in intelligent
applications. Nevertheless, it is elusive for devices with limited resources to support …

Transformer inference acceleration in edge computing environment

M Li, W Zhang, D Xia - 2023 IEEE/ACM 23rd International …, 2023 - ieeexplore.ieee.org
The rapid development of deep neural networks (DNNs) has provided a strong foundation
for the popularization of intelligent applications. However, the limited computing power of IoT …

Masa: Responsive multi-dnn inference on the edge

B Cox, J Galjaard, A Ghiassi, R Birke… - 2021 IEEE …, 2021 - ieeexplore.ieee.org
Deep neural networks (DNNs) are becoming the core components of many applications
running on edge devices, especially for real time image-based analysis. Increasingly, multi …