Apollo: Scalable and coordinated scheduling for {Cloud-Scale} computing

E Boutin, J Ekanayake, W Lin, B Shi, J Zhou… - … USENIX symposium on …, 2014 - usenix.org
Efficiently scheduling data-parallel computation jobs over cloud-scale computing clusters is
critical for job performance, system throughput, and resource utilization. It is becoming even …

Tales of the tail: Hardware, os, and application-level sources of tail latency

J Li, NK Sharma, DRK Ports, SD Gribble - Proceedings of the ACM …, 2014 - dl.acm.org
Interactive services often have large-scale parallel implementations. To deliver fast
responses, the median and tail latencies of a service's components must be low. In this …

Calendaring for wide area networks

S Kandula, I Menache, R Schwartz… - Proceedings of the 2014 …, 2014 - dl.acm.org
Datacenter WAN traffic consists of high priority transfers that have to be carried as soon as
they arrive alongside large transfers with pre-assigned deadlines on their completion …

Reservation-based scheduling: If you're late don't blame us!

C Curino, DE Difallah, C Douglas, S Krishnan… - Proceedings of the …, 2014 - dl.acm.org
The continuous shift towards data-driven approaches to business, and a growing attention to
improving return on investments (ROI) for cluster infrastructures is generating new …

Matrix: Achieving predictable virtual machine performance in the clouds

RC Chiang, J Hwang, HH Huang, T Wood - … International Conference on …, 2014 - usenix.org
The success of cloud computing builds largely upon on-demand supply of virtual machines
(VMs) that provide the abstraction of a physical machine on shared resources. Unfortunately …

Mammoth: Gearing hadoop towards memory-intensive mapreduce applications

X Shi, M Chen, L He, X Xie, L Lu, H Jin… - … on Parallel and …, 2014 - ieeexplore.ieee.org
The MapReduce platform has been widely used for large-scale data processing and
analysis recently. It works well if the hardware of a cluster is well configured. However, our …

Evaluating job packing in warehouse-scale computing

A Verma, M Korupolu, J Wilkes - 2014 IEEE International …, 2014 - ieeexplore.ieee.org
One of the key factors in selecting a good scheduling algorithm is using an appropriate
metric for comparing schedulers. But which metric should be used when evaluating …

Extending mapreduce across clouds with bstream

S Kailasam, P Dhawalia, SJ Balaji… - … on Cloud Computing, 2014 - ieeexplore.ieee.org
Today, batch processing frameworks like Hadoop MapReduce are difficult to scale to
multiple clouds due to latencies involved in inter-cloud data transfer and synchronization …

Deadline-aware scheduling of big-data processing jobs

P Bodík, I Menache, J Naor, J Yaniv - … of the 26th ACM symposium on …, 2014 - dl.acm.org
This paper presents a novel algorithm for scheduling big data jobs on large compute
clusters. In our model, each job is represented by a DAG consisting of several stages linked …

A novel real-time scheduling algorithm and performance analysis of a MapReduce-based cloud

F Teng, F Magoulès, L Yu, T Li - The Journal of Supercomputing, 2014 - Springer
MapReduce, a popular programming model for processing data-intensive tasks, has
achieved great success in a wide range of applications such as search indexing, social …