Decentralized low-latency collaborative inference via ensembles on the edge

M Malka, E Farhan, H Morgenstern… - arXiv preprint arXiv …, 2022 - arxiv.org
The success of deep neural networks (DNNs) is heavily dependent on computational
resources. While DNNs are often employed on cloud servers, there is a growing need to …

Accelerating deep learning inference via model parallelism and partial computation offloading

H Zhou, M Li, N Wang, G Min… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
With the rapid development of Internet-of-Things (IoT) and the explosive advance of deep
learning, there is an urgent need to enable deep learning inference on IoT devices in Mobile …

A unified federated DNNs framework for heterogeneous mobile devices

X Li, Y Li, S Li, Y Zhou, C Chen… - IEEE Internet of Things …, 2021 - ieeexplore.ieee.org
Mobile devices can generate a tremendous amount of unique data, and thus, create
countless opportunities for deep learning tasks. Due to the concerns of data privacy, it is …

Multi-agent collaborative inference via dnn decoupling: Intermediate feature compression and edge learning

Z Hao, G Xu, Y Luo, H Hu, J An… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Recently, deploying deep neural network (DNN) models via collaborative inference, which
splits a pre-trained model into two parts and executes them on user equipment (UE) and …

Improving device-edge cooperative inference of deep learning via 2-step pruning

W Shi, Y Hou, S Zhou, Z Niu, Y Zhang… - IEEE INFOCOM 2019 …, 2019 - ieeexplore.ieee.org
Deep neural networks (DNNs) are state-of-the-art solutions for many machine learning
applications, and have been widely used on mobile devices. Running DNNs on …

An adaptive DNN inference acceleration framework with end–edge–cloud collaborative computing

G Liu, F Dai, X Xu, X Fu, W Dou, N Kumar… - Future Generation …, 2023 - Elsevier
Abstract Deep Neural Networks (DNNs) based on intelligent applications have been
intensively deployed on mobile devices. Unfortunately, resource-constrained mobile devices …

DECC: Delay-Aware Edge-Cloud Collaboration for Accelerating DNN Inference

Z Zhuang, J Chen, W Xu, Q Qi, S Guo… - … on Emerging Topics …, 2024 - ieeexplore.ieee.org
Deep neural network (DNN)-enabled edge intelligence has been widely adopted to support
a variety of smart applications because of its ability to preserve privacy and conserve …

Enable pipeline processing of DNN co-inference tasks in the mobile-edge cloud

S Hu, C Dong, W Wen - 2021 IEEE 6th International …, 2021 - ieeexplore.ieee.org
Deep Neural Network (DNN) based artificial intelligence help driving the great development
of mobile Internet. However, the hardware of a mobile device may not be sufficiently to meet …

MJOA-MU: End-to-edge collaborative computation for DNN inference based on model uploading

H Yang, S Sun, M Liu, Q Zhang, Y Wang - Computer Networks, 2023 - Elsevier
As an emerging computing paradigm, edge computing can assist user equipments (UEs) in
executing computation-intensive deep neural network (DNN) inference tasks, thereby …

Coedge: Cooperative dnn inference with adaptive workload partitioning over heterogeneous edge devices

L Zeng, X Chen, Z Zhou, L Yang… - IEEE/ACM Transactions …, 2020 - ieeexplore.ieee.org
Recent advances in artificial intelligence have driven increasing intelligent applications at
the network edge, such as smart home, smart factory, and smart city. To deploy …