FLASH: Fast model adaptation in ML-centric cloud platforms

H Qiu, W Mao, A Patke, S Cui, C Wang… - Proceedings of …, 2024 - proceedings.mlsys.org
The emergence of ML in various cloud system management tasks (eg, workload autoscaling
and job scheduling) has become a core driver of ML-centric cloud platforms. However, there …

Reinforcement learning for resource management in multi-tenant serverless platforms

H Qiu, W Mao, A Patke, C Wang, H Franke… - Proceedings of the 2nd …, 2022 - dl.acm.org
Serverless Function-as-a-Service (FaaS) is an emerging cloud computing paradigm that
frees application developers from infrastructure management tasks such as resource …

SIMPPO: A scalable and incremental online learning framework for serverless resource management

H Qiu, W Mao, A Patke, C Wang, H Franke… - Proceedings of the 13th …, 2022 - dl.acm.org
Serverless Function-as-a-Service (FaaS) offers improved programmability for customers, yet
it is not server-" less" and comes at the cost of more complex infrastructure management (eg …

Prebaking runtime environments to improve the FaaS cold start latency

D Fireman, P Silva, TE Pereira, L Mafra… - Future Generation …, 2024 - Elsevier
Abstract Function-as-service (FaaS) platforms promise a simpler programming model for
cloud computing, given that providers take care of the overall resource management while …

[PDF][PDF] On the promise and challenges of foundation models for learning-based cloud systems management

H Qiu, W Mao, CWH Franke, ZT Kalbarczyk… - … on Machine Learning …, 2023 - haoran-qiu.com
Foundation models (FMs) are machine learning models that are trained broadly on large-
scale data and can be adapted to a set of downstream tasks via fine-tuning, few-shot …

[PDF][PDF] PARM: Adaptive resource allocation for datacenter power capping

H Qiu, L Zhang, CWH Franke… - Machine Learning for …, 2023 - haoran-qiu.com
Energy efficiency is pressing in today's cloud datacenters. Various power management
strategies, such as oversubscription, power capping, and dynamic voltage and frequency …

Enhanced Runtime-Adaptable Routing for Serverless Functions based on Performance and Cost Tradeoffs in Hybrid Cloud Settings

G Fatouros, G Kousiouris, G Makridis… - … on Cloud Computing …, 2023 - ieeexplore.ieee.org
Serverless computing has reshaped the cloud computing landscape by offering benefits
such as auto-scalability, streamlined operational management, and granular billing. As its …

Accelerating automation of digital health applications via cloud native approach

B Wen, Y Koyfman, H Tian, B Lublinsky… - Proceedings of the …, 2022 - dl.acm.org
Writing thread safe code for concurrent processing requires experience and training, thus
legacy research code are usually single threaded, which post a challenge when it comes to …

Optimizing simultaneous autoscaling for serverless cloud computing

H Ship, E Shindin, C Wang, D Arroyo… - arXiv preprint arXiv …, 2023 - arxiv.org
This paper explores resource allocation in serverless cloud computing platforms and
proposes an optimization approach for autoscaling systems. Serverless computing relieves …

Mitigating Serverless Tail Latency: A Comprehensive Study of Factors and Strategies

A Kumari, RK Behera, B Sahoo - 2023 OITS International …, 2023 - ieeexplore.ieee.org
Serverless computing has gained widespread popularity due to its ability to dynamically
allocate resources and abstract away the underlying infrastructure, allowing developers to …