A survey on resource management in joint communication and computing-embedded SAGIN

Q Chen, Z Guo, W Meng, S Han, C Li… - … Surveys & Tutorials, 2024 - ieeexplore.ieee.org
The advent of the 6G era aims for ubiquitous connectivity, with the integration of non-
terrestrial networks (NTN) offering extensive coverage and enhanced capacity. As …

Machine learning-based computation offloading in multi-access edge computing: A survey

A Choudhury, M Ghose, A Islam - Journal of Systems Architecture, 2024 - Elsevier
The advancement of technology towards the realization of the evolving mobile computing
paradigm brings a rapid paradigm shift in its usage, especially in the Internet, computation …

Accelerating deep learning inference via model parallelism and partial computation offloading

H Zhou, M Li, N Wang, G Min… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
With the rapid development of Internet-of-Things (IoT) and the explosive advance of deep
learning, there is an urgent need to enable deep learning inference on IoT devices in Mobile …

Joint optimization of DNN partition and scheduling for mobile cloud computing

Y Duan, J Wu - Proceedings of the 50th International Conference on …, 2021 - dl.acm.org
Reducing the inference time of Deep Neural Networks (DNNs) is critical when running time
sensitive applications on mobile devices. Existing research has shown that partitioning a …

Partition placement and resource allocation for multiple DNN-based applications in heterogeneous IoT environments

T Kim, H Park, Y Jin, SS Lee… - IEEE Internet of Things …, 2023 - ieeexplore.ieee.org
The evolution of the Internet of Things (IoT) has been driving the explosive growth of deep
neural network (DNN)-based applications and processing demands. Hence, edge …

Optimizing job offloading schedule for collaborative DNN inference

Y Duan, J Wu - IEEE Transactions on Mobile Computing, 2023 - ieeexplore.ieee.org
Deep Neural Networks (DNNs) have been widely deployed in mobile applications. DNN
inference latency is a critical metric to measure the service quality of those applications …

Cloud-assisted collaborative inference of convolutional neural networks for vision tasks on resource-constrained devices

I Rodriguez-Conde, C Campos, F Fdez-Riverola - Neurocomputing, 2023 - Elsevier
This work analyses the most relevant research conducted under the mobile cloud computing
paradigm to bring vision tasks supported by state-of-the-art deep convolutional neural …

A flexible algorithm to offload DAG applications for edge computing

GFC de Queiroz, JF de Rezende… - Journal of Network and …, 2024 - Elsevier
Abstract Multi-access Edge Computing (MEC) is an enabling technology to leverage new
network applications, such as virtual/augmented reality, by providing faster task processing …

Multi-source dnn task offloading strategy based on in-network computing

L Hu, Y Chai, Q Li, W Li, Y Zhang - 2023 25th International …, 2023 - ieeexplore.ieee.org
As applications grow in scale, the centralized computing approach leads to excessive
bandwidth requirements and high computational latencies. The traditional computing model …

Fused-Layer-based DNN Model Parallelism and Partial Computation Offloading

M Li, N Wang, H Zhou, Y Duan… - GLOBECOM 2022-2022 …, 2022 - ieeexplore.ieee.org
With the development of Internet of Things (IoT) and the advance of deep learning, there is
an urgent need to enable deep learning inference on IoT devices. To address the …