Pond: Cxl-based memory pooling systems for cloud platforms

H Li, DS Berger, L Hsu, D Ernst, P Zardoshti… - Proceedings of the 28th …, 2023 - dl.acm.org
Public cloud providers seek to meet stringent performance requirements and low hardware
cost. A key driver of performance and cost is main memory. Memory pooling promises to …

{FIRM}: An intelligent fine-grained resource management framework for {SLO-Oriented} microservices

H Qiu, SS Banerjee, S Jha, ZT Kalbarczyk… - 14th USENIX symposium …, 2020 - usenix.org
User-facing latency-sensitive web services include numerous distributed,
intercommunicating microservices that promise to simplify software development and …

An open-source benchmark suite for microservices and their hardware-software implications for cloud & edge systems

Y Gan, Y Zhang, D Cheng, A Shetty, P Rathi… - Proceedings of the …, 2019 - dl.acm.org
Cloud services have recently started undergoing a major shift from monolithic applications,
to graphs of hundreds or thousands of loosely-coupled microservices. Microservices …

Sinan: ML-based and QoS-aware resource management for cloud microservices

Y Zhang, W Hua, Z Zhou, GE Suh… - Proceedings of the 26th …, 2021 - dl.acm.org
Cloud applications are increasingly shifting from large monolithic services, to large numbers
of loosely-coupled, specialized microservices. Despite their advantages in terms of …

Who limits the resource efficiency of my datacenter: An analysis of alibaba datacenter traces

J Guo, Z Chang, S Wang, H Ding, Y Feng… - Proceedings of the …, 2019 - dl.acm.org
Cloud platform provides great flexibility and cost-efficiency for end-users and cloud
operators. However, low resource utilization in modern datacenters brings huge wastes of …

Parties: Qos-aware resource partitioning for multiple interactive services

S Chen, C Delimitrou, JF Martínez - Proceedings of the Twenty-Fourth …, 2019 - dl.acm.org
Multi-tenancy in modern datacenters is currently limited to a single latency-critical,
interactive service, running alongside one or more low-priority, best-effort jobs. This limits …

Cluster resource scheduling in cloud computing: literature review and research challenges

W Khallouli, J Huang - The Journal of supercomputing, 2022 - Springer
Scheduling plays a pivotal role in cloud computing systems. Designing an efficient
scheduler is a challenging task. The challenge comes from several aspects, including the …

Seer: Leveraging big data to navigate the complexity of performance debugging in cloud microservices

Y Gan, Y Zhang, K Hu, D Cheng, Y He… - Proceedings of the …, 2019 - dl.acm.org
Performance unpredictability is a major roadblock towards cloud adoption, and has
performance, cost, and revenue ramifications. Predictable performance is even more critical …

Protean:{VM} allocation service at scale

O Hadary, L Marshall, I Menache, A Pan… - … USENIX Symposium on …, 2020 - usenix.org
We describe the design and implementation of Protean--the Microsoft Azure service
responsible for allocating Virtual Machines (VMs) to millions of servers around the globe. A …

Sage: practical and scalable ML-driven performance debugging in microservices

Y Gan, M Liang, S Dev, D Lo, C Delimitrou - Proceedings of the 26th …, 2021 - dl.acm.org
Cloud applications are increasingly shifting from large monolithic services to complex
graphs of loosely-coupled microservices. Despite the advantages of modularity and …