Omega: flexible, scalable schedulers for large compute clusters

M Schwarzkopf, A Konwinski, M Abd-El-Malek… - Proceedings of the 8th …, 2013 - dl.acm.org
Increasing scale and the need for rapid response to changing requirements are hard to meet
with current monolithic cluster scheduler architectures. This restricts the rate at which new …

Speeding up distributed request-response workflows

V Jalaparti, P Bodik, S Kandula, I Menache… - ACM SIGCOMM …, 2013 - dl.acm.org
We found that interactive services at Bing have highly variable datacenter-side processing
latencies because their processing consists of many sequential stages, parallelization …

Efficient online scheduling for deadline-sensitive jobs

B Lucier, I Menache, J Naor, J Yaniv - … of the twenty-fifth annual ACM …, 2013 - dl.acm.org
We consider mechanisms for online deadline-aware scheduling in large computing clusters.
Batch jobs that run on such clusters often require guarantees on their completion time (ie …

Natjam: Design and evaluation of eviction policies for supporting priorities and deadlines in mapreduce clusters

B Cho, M Rahman, T Chajed, I Gupta, C Abad… - Proceedings of the 4th …, 2013 - dl.acm.org
This paper presents Natjam, a system that supports arbitrary job priorities, hard real-time
scheduling, and efficient preemption for Mapreduce clusters that are resource-constrained …

Fault Management in {Map-Reduce} Through Early Detection of Anomalous Nodes

S Kadirvel, J Ho, JAB Fortes - 10th international conference on …, 2013 - usenix.org
Map-Reduce frameworks such as Hadoop have built-in fault-tolerance mechanisms that
allow jobs to run to completion even in the presence of certain faults. However, these jobs …

Accelerating batch analytics with residual resources from interactive clouds

RB Clay, Z Shen, X Ma - 2013 IEEE 21st International …, 2013 - ieeexplore.ieee.org
The popularity of cloud-based interactive computing services (eg, virtual desktops) brings
new management challenges. Each interactive user leaves abundant but fluctuating …

Real-time scheduling in mapreduce clusters

C He, Y Lu, D Swanson - 2013 IEEE 10th International …, 2013 - ieeexplore.ieee.org
MapReduce has been widely used as a Big Data processing platform. As it gets popular, its
scheduling becomes increasingly important. In particular, since many MapReduce …

[PDF][PDF] Exploiting time-malleability in cloud-based batch processing systems

L Mai, E Kalyvianaki, P Costa - 2013 - openaccess.city.ac.uk
Existing cloud provisioning schemes allocate resources to batch processing systems at
deployment time and only change this allocation at run-time due to unexpected events such …

Scheduling real-time workflow on MapReduce-based cloud

F Teng, H Yang, T Li, Y Yang… - … Conference on Innovative …, 2013 - ieeexplore.ieee.org
As a popular programming model in cloud-based data processing environment, MapReduce
and its open source implementation Hadoop, are widely applied both in industry and …

[PDF][PDF] Tetrisched: Space-time scheduling for heterogeneous datacenters

A Tumanov, T Zhu, MA Kozuch, M Harchol-Balter… - Tech. Rep., 2013 - pdl.cmu.edu
Tetrisched is a new scheduler that explicitly considers both job-specific preferences and
estimated job runtimes in its allocation of resources. Combined, this information allows …