Morpheus: Towards automated {SLOs} for enterprise clusters

SA Jyothi, C Curino, I Menache… - … USENIX symposium on …, 2016 - usenix.org
Modern resource management frameworks for largescale analytics leave unresolved the
problematic tension between high cluster utilization and job's performance predictability …

The power of choice in {Data-Aware} cluster scheduling

S Venkataraman, A Panda… - … USENIX Symposium on …, 2014 - usenix.org
Providing timely results in the face of rapid growth in data volumes has become important for
analytical frameworks. For this reason, frameworks increasingly operate on only a subset of …

Quasar: Resource-efficient and qos-aware cluster management

C Delimitrou, C Kozyrakis - ACM Sigplan Notices, 2014 - dl.acm.org
Cloud computing promises flexibility and high performance for users and high cost-efficiency
for operators. Nevertheless, most cloud facilities operate at very low utilization, hurting both …

3sigma: distribution-based cluster scheduling for runtime uncertainty

JW Park, A Tumanov, A Jiang, MA Kozuch… - Proceedings of the …, 2018 - dl.acm.org
The 3Sigma cluster scheduling system uses job runtime histories in a new way. Knowing
how long each job will execute enables a scheduler to more effectively pack jobs with …

Tarcil: Reconciling scheduling speed and quality in large shared clusters

C Delimitrou, D Sanchez, C Kozyrakis - … of the Sixth ACM Symposium on …, 2015 - dl.acm.org
Scheduling diverse applications in large, shared clusters is particularly challenging. Recent
research on cluster scheduling focuses either on scheduling speed, using sampling to …

Efficient queue management for cluster scheduling

J Rasley, K Karanasos, S Kandula, R Fonseca… - Proceedings of the …, 2016 - dl.acm.org
Job scheduling in Big Data clusters is crucial both for cluster operators' return on investment
and for overall user experience. In this context, we observe several anomalies in how …

Reservation-based scheduling: If you're late don't blame us!

C Curino, DE Difallah, C Douglas, S Krishnan… - Proceedings of the …, 2014 - dl.acm.org
The continuous shift towards data-driven approaches to business, and a growing attention to
improving return on investments (ROI) for cluster infrastructures is generating new …

Sinan: ML-based and QoS-aware resource management for cloud microservices

Y Zhang, W Hua, Z Zhou, GE Suh… - Proceedings of the 26th …, 2021 - dl.acm.org
Cloud applications are increasingly shifting from large monolithic services, to large numbers
of loosely-coupled, specialized microservices. Despite their advantages in terms of …

Providing {SLOs} for {Resource-Harvesting}{VMs} in cloud platforms

P Ambati, Í Goiri, F Frujeri, A Gun, K Wang… - … USENIX Symposium on …, 2020 - usenix.org
Cloud providers rent the resources they do not allocate as evictable virtual machines (VMs),
like spot instances. In this paper, we first characterize the unallocated resources in Microsoft …

Altruistic scheduling in {Multi-Resource} clusters

R Grandl, M Chowdhury, A Akella… - … USENIX symposium on …, 2016 - usenix.org
Given the well-known tradeoffs between fairness, performance, and efficiency, modern
cluster schedulers often prefer instantaneous fairness as their primary objective to ensure …