We present Grappa, a modern take on software distributed shared memory (DSM) for in- memory data-intensive applications. Grappa enables users to program a cluster as if it were …
Recent improvements in both the performance and scalability of shared-nothing, transactional, in-memory NewSQL databases have reopened the research question of …
Six years ago, Google released an invaluable set of scheduler logs which has already been used in more than 450 publications. We find that the scarcity of other data sources, however …
K Ren, Q Zheng, S Patil… - SC'14: Proceedings of the …, 2014 - ieeexplore.ieee.org
The growing size of modern storage systems is expected to exceed billions of objects, making metadata scalability critical to overall performance. Many existing distributed file …
Pattern queries are widely used in complex event processing (CEP) systems. Existing pattern matching techniques, however, can provide only limited performance for expensive …
M Xia, M Saxena, M Blaum, DA Pease - 13th USENIX conference on file …, 2015 - usenix.org
Distributed storage systems are increasingly transitioning to the use of erasure codes since they offer higher reliability at significantly lower storage costs than data replication. However …
Data analytics has recently grown to include increasingly sophisticated techniques, such as machine learning and advanced statistics. Users frequently express these complex analytics …
X Chen, CD Lu, K Pattabiraman - 2014 IEEE 25th International …, 2014 - ieeexplore.ieee.org
In this paper, we analyze a workload trace from the Google cloud cluster and characterize the observed failures. The goal of our work is to improve the understanding of failures in …
With widespread advances in machine learning, a number of large enterprises are beginning to incorporate machine learning models across a number of products. These …