Hawk: Hybrid datacenter scheduling

P Delgado, F Dinu, AM Kermarrec… - 2015 USENIX Annual …, 2015 - usenix.org
Hawk: Hybrid Datacenter Scheduling Page 1 This paper is included in the Proceedings of
the 2015 USENIX Annual Technical Conference (USENIC ATC ’15). July 8–10, 2015 • …

{Latency-Tolerant} software distributed shared memory

J Nelson, B Holt, B Myers, P Briggs, L Ceze… - 2015 USENIX Annual …, 2015 - usenix.org
We present Grappa, a modern take on software distributed shared memory (DSM) for in-
memory data-intensive applications. Grappa enables users to program a cluster as if it were …

{HopsFS}: Scaling hierarchical file system metadata using {NewSQL} databases

S Niazi, M Ismail, S Haridi, J Dowling… - … USENIX Conference on …, 2017 - usenix.org
Recent improvements in both the performance and scalability of shared-nothing,
transactional, in-memory NewSQL databases have reopened the research question of …

On the diversity of cluster workloads and its impact on research results

G Amvrosiadis, JW Park, GR Ganger… - 2018 USENIX Annual …, 2018 - usenix.org
Six years ago, Google released an invaluable set of scheduler logs which has already been
used in more than 450 publications. We find that the scarcity of other data sources, however …

IndexFS: Scaling file system metadata performance with stateless caching and bulk insertion

K Ren, Q Zheng, S Patil… - SC'14: Proceedings of the …, 2014 - ieeexplore.ieee.org
The growing size of modern storage systems is expected to exceed billions of objects,
making metadata scalability critical to overall performance. Many existing distributed file …

On complexity and optimization of expensive queries in complex event processing

H Zhang, Y Diao, N Immerman - Proceedings of the 2014 ACM SIGMOD …, 2014 - dl.acm.org
Pattern queries are widely used in complex event processing (CEP) systems. Existing
pattern matching techniques, however, can provide only limited performance for expensive …

A tale of two erasure codes in {HDFS}

M Xia, M Saxena, M Blaum, DA Pease - 13th USENIX conference on file …, 2015 - usenix.org
Distributed storage systems are increasingly transitioning to the use of erasure codes since
they offer higher reliability at significantly lower storage costs than data replication. However …

An architecture for compiling udf-centric workflows

A Crotty, A Galakatos, K Dursun, T Kraska… - Proceedings of the …, 2015 - dl.acm.org
Data analytics has recently grown to include increasingly sophisticated techniques, such as
machine learning and advanced statistics. Users frequently express these complex analytics …

Failure analysis of jobs in compute clouds: A google cluster case study

X Chen, CD Lu, K Pattabiraman - 2014 IEEE 25th International …, 2014 - ieeexplore.ieee.org
In this paper, we analyze a workload trace from the Google cloud cluster and characterize
the observed failures. The goal of our work is to improve the understanding of failures in …

[PDF][PDF] Multi-tenant GPU clusters for deep learning workloads: Analysis and implications

M Jeon, S Venkataraman, J Qian… - Technical report …, 2018 - microsoft.com
With widespread advances in machine learning, a number of large enterprises are
beginning to incorporate machine learning models across a number of products. These …