Scavenger: A black-box batch workload resource manager for improving utilization in cloud environments

SA Javadi, A Suresh, M Wajahat, A Gandhi - Proceedings of the ACM …, 2019 - dl.acm.org
Resource under-utilization is common in cloud data centers. Prior works have proposed
improving utilization by running provider workloads in the background, colocated with tenant …

Understanding and optimizing workloads for unified resource management in large cloud platforms

C Lu, H Xu, K Ye, G Xu, L Zhang, G Yang… - Proceedings of the …, 2023 - dl.acm.org
To fully utilize computing resources, cloud providers such as Google and Alibaba choose to
co-locate online services with batch processing applications in their data centers. By …

DCloud: deadline-aware resource allocation for cloud computing jobs

D Li, C Chen, J Guan, Y Zhang… - IEEE transactions on …, 2015 - ieeexplore.ieee.org
With the tremendous growth of cloud computing, it is increasingly critical to provide
quantifiable performance to tenants and to improve resource utilization for the cloud …

{History-Based} harvesting of spare cycles and storage in {Large-Scale} datacenters

Y Zhang, G Prekas, GM Fumarola, M Fontoura… - … USENIX Symposium on …, 2016 - usenix.org
An effective way to increase utilization and reduce costs in datacenters is to co-locate their
latency-critical services and batch workloads. In this paper, we describe systems that harvest …

WorkloadCompactor: Reducing datacenter cost while providing tail latency SLO guarantees

T Zhu, MA Kozuch, M Harchol-Balter - … of the 2017 Symposium on Cloud …, 2017 - dl.acm.org
Service providers want to reduce datacenter costs by consolidating workloads onto fewer
servers. At the same time, customers have performance goals, such as meeting tail latency …

Who limits the resource efficiency of my datacenter: An analysis of alibaba datacenter traces

J Guo, Z Chang, S Wang, H Ding, Y Feng… - Proceedings of the …, 2019 - dl.acm.org
Cloud platform provides great flexibility and cost-efficiency for end-users and cloud
operators. However, low resource utilization in modern datacenters brings huge wastes of …

Parties: Qos-aware resource partitioning for multiple interactive services

S Chen, C Delimitrou, JF Martínez - Proceedings of the Twenty-Fourth …, 2019 - dl.acm.org
Multi-tenancy in modern datacenters is currently limited to a single latency-critical,
interactive service, running alongside one or more low-priority, best-effort jobs. This limits …

Rhythm: component-distinguishable workload deployment in datacenters

L Zhao, Y Yang, K Zhang, X Zhou, T Qiu, K Li… - Proceedings of the …, 2020 - dl.acm.org
Cloud service providers improve resource utilization by co-locating latency-critical (LC)
workloads with best-effort batch (BE) jobs in datacenters. However, they usually treat an LC …

Improving cloud infrastructure utilization through overbooking

L Tomás, J Tordsson - Proceedings of the 2013 ACM Cloud and …, 2013 - dl.acm.org
Despite the potential given by the combination of multi-tenancy and virtualization, resource
utilization in today's data centers is still low. We identify three key characteristics of cloud …

Autonomic mix-aware provisioning for non-stationary data center workloads

R Singh, U Sharma, E Cecchet, P Shenoy - Proceedings of the 7th …, 2010 - dl.acm.org
Online Internet applications see dynamic workloads that fluctuate over multiple time scales.
This paper argues that the non-stationarity in Internet application workloads, which causes …