Integrating system state into spatio temporal graph neural network for microservice workload prediction

Y Luo, M Gao, Z Yu, H Ge, X Gao, T Cai… - Proceedings of the 30th …, 2024 - dl.acm.org
Microservice architecture has become a driving force in enhancing the modularity and
scalability of web applications, as evidenced by the Alipay platform's operational success …

A deep reinforcement learning approach towards distributed Function as a Service (FaaS) based edge application orchestration in cloud-edge continuum

ME Khansari, S Sharifian - Journal of Network and Computer Applications, 2025 - Elsevier
Serverless computing has emerged as a new cloud computing model which in contrast to
IoT offers unlimited and scalable access to resources. This paradigm improves resource …

ProKube: Proactive kubernetes orchestrator for inference in heterogeneous edge computing

B Ali, M Golec, S Singh Gill… - International Journal of …, 2025 - Wiley Online Library
Deep neural network (DNN) and machine learning (ML) models/inferences produce highly
accurate results demanding enormous computational resources. The limited capacity of end …

Gma: graph multi-agent microservice autoscaling algorithm in edge-cloud environment

G Tong, C Meng, S Song, M Pan… - 2023 IEEE international …, 2023 - ieeexplore.ieee.org
The emerging edge-cloud computing paradigm, comprising cloud centers and multiple
distributed edge servers, extends the computing capability from the cloud center to a range …

Humas: A heterogeneity-and upgrade-aware microservice auto-scaling framework in large-scale data centers

Q Hua, D Yang, S Qian, J Cao… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
An effective auto-scaling framework is essential for microservices to ensure performance
stability and resource efficiency under dynamic workloads. As revealed by many prior …

Towards Minimum Latency in Cloud-Native Applications via Service-Characteristic-Aware Microservice Deployment

R Xie, L Wang, C Song - 2024 IEEE International Conference …, 2024 - ieeexplore.ieee.org
Microservice applications are gaining popularity as cloud-native embraced by the IT
industry. However, they suffer from a latency problem because they intrinsically involve …

OSCA: Online user-managed server selection and configuration adaptation for interactive mar

X Shi, S Zhang, K Cheng, Y Chen… - 2023 IEEE/ACM 31st …, 2023 - ieeexplore.ieee.org
Interactive mobile augmented reality (MAR) applications such as Connected Lens are
becoming popular, which often rely on deep neural network (NN)-based video analytics …

MicroIRC: Instance-level Root Cause Localization for Microservice Systems

Y Zhu, J Wang, B Li, Y Zhao, Z Zhang, Y Xiong… - Journal of Systems and …, 2024 - Elsevier
The use of microservice architecture is gaining popularity in the development of web
applications. However, identifying the root cause of a failure can be challenging due to the …

TP-MDU: A Two-Phase Microservice Deployment Based on Minimal Deployment Unit in Edge Computing Environment

B Tang, Z Wu, W Xu, B Cao, M Tang… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
In mobile edge computing (MEC) environment, effective microservices deployment
significantly reduces vendor costs and minimizes application latency. However, existing …

GeoScale: Microservice Autoscaling With Cost Budget in Geo-Distributed Edge Clouds

K Cheng, S Zhang, M Liu, Y Gu, L Wei… - … on Parallel and …, 2024 - ieeexplore.ieee.org
Deploying microservice instances in geo-distributed edge clouds which are located at the
network edge and in proximity to end-users can provide on-site processing, thereby …