A survey of state-of-the-art on edge computing: Theoretical models, technologies, directions, and development paths

B Liu, Z Luo, H Chen, C Li - IEEE Access, 2022 - ieeexplore.ieee.org
In order to describe the roadmap of current edge computing research activities, we first
address a brief overview of the most advanced edge computing surveys published in the last …

SPACE4AI-R: a runtime management tool for AI applications component placement and resource scaling in computing continua

F Filippini, H Sedghani, D Ardagna - Proceedings of the IEEE/ACM 16th …, 2023 - dl.acm.org
The recent migration towards Internet of Things determined the rise of a Computing
Continuum paradigm where Edge and Cloud resources coordinate to support the execution …

Understanding the Benefits of Hardware-Accelerated Communication in Model-Serving Applications

WA Hanafy, L Wang, H Chang… - 2023 IEEE/ACM 31st …, 2023 - ieeexplore.ieee.org
It is commonly assumed that the end-to-end networking performance of edge offloading is
purely dictated by that of the network connectivity between end devices and edge computing …

D-STACK: High Throughput DNN Inference by Effective Multiplexing and Spatio-Temporal Scheduling of GPUs

A Dhakal, SG Kulkarni, KK Ramakrishnan - arXiv preprint arXiv …, 2023 - arxiv.org
Hardware accelerators such as GPUs are required for real-time, low-latency inference with
Deep Neural Networks (DNN). However, due to the inherent limits to the parallelism they …

CloudAIBus: a testbed for AI based cloud computing environments

S Velu, SS Gill, SS Murugesan, H Wu, X Li - Cluster Computing, 2024 - Springer
Smart resource allocation is essential for optimising cloud computing efficiency and
utilisation, but it is also very challenging as traditional approaches often overprovision CPU …

IoT video analytics for surveillance-based systems in smart cities

K Aminiyeganeh, RWL Coutinho… - Computer Communications, 2024 - Elsevier
Smart city applications are revolutionizing the way people interact with diverse systems in
city-wide applications. Internet of Things (IoT) and machine learning are two enabling …

Tail-Learning: Adaptive Learning Method for Mitigating Tail Latency in Autonomous Edge Systems

C Zhang, Y Deng, H Zhao, T Chen, S Deng - arXiv preprint arXiv …, 2023 - arxiv.org
In the realm of edge computing, the increasing demand for high Quality of Service (QoS),
particularly in dynamic multimedia streaming applications (eg, Augmented Reality/Virtual …

Empowering Sustainable Industrial and Service Systems through AI-Enhanced Cloud Resource Optimization

C Seo, D Yoo, Y Lee - Sustainability, 2024 - mdpi.com
This study focuses on examining the shift of an application system from a traditional
monolithic architecture to a cloud-native microservice architecture (MSA), with a specific …

Energy Time Fairness: Balancing Fair Allocation of Energy and Time for GPU Workloads

Q Liang, WA Hanafy, N Bashir, D Irwin… - 2023 IEEE/ACM …, 2023 - ieeexplore.ieee.org
Traditionally, multi-tenant cloud and edge platforms use fair-share schedulers to fairly
multiplex resources across applications. These schedulers ensure applications receive …

Enhanced Scheduling of AI Applications in Multi-Tenant Cloud Using Genetic Optimizations

S Kwon, H Bahn - Applied Sciences, 2024 - mdpi.com
The artificial intelligence (AI) industry is increasingly integrating with diverse sectors such as
smart logistics, FinTech, entertainment, and cloud computing. This expansion has led to the …