Performance bug analysis and detection for distributed storage and computing systems

J Li, Y Zhang, S Lu, HS Gunawi, X Gu… - ACM Transactions on …, 2023 - dl.acm.org
This article systematically studies 99 distributed performance bugs from five widely deployed
distributed storage and computing systems (Cassandra, HBase, HDFS, Hadoop …

Pcatch: Automatically detecting performance cascading bugs in cloud systems

J Li, Y Chen, H Liu, S Lu, Y Zhang, HS Gunawi… - Proceedings of the …, 2018 - dl.acm.org
Distributed systems have become the backbone of modern clouds. Users often expect high
scalability and performance isolation from distributed systems. Unfortunately, a type of poor …

Evaluating scalability bottlenecks by workload extrapolation

R Shi, Y Gan, Y Wang - 2018 IEEE 26th international …, 2018 - ieeexplore.ieee.org
Testing a scalability bottleneck requires a large system to generate sufficient load, which is
usually not accessible to researchers. To address this problem, this paper extrapolates the …

Sliding {Look-Back} Window Assisted Data Chunk Rewriting for Improving Deduplication Restore Performance

Z Cao, S Liu, F Wu, G Wang, B Li, DHC Du - 17th USENIX Conference …, 2019 - usenix.org
Data deduplication is an effective way of improving storage space utilization. The data
generated by deduplication is persistently stored in data chunks or data containers (a …

Pascal: An architecture for proactive auto-scaling of distributed services

F Lombardi, A Muti, L Aniello, R Baldoni… - Future Generation …, 2019 - Elsevier
One of the main characteristics that today makes cloud services so popular is their ability to
be elastic, ie, they can adapt their provisioning to variable workloads, thus increasing …

Sammple: Detecting semantic indoor activities in practical settings using locomotive signatures

Z Yan, D Chakraborty, A Misra… - 2012 16th …, 2012 - ieeexplore.ieee.org
We analyze the ability of mobile phone-generated accelerometer data to detect high-level
(ie, at the semantic level) indoor lifestyle activities, such as cooking at home and working at …

{ScaleCheck}: A {Single-Machine} Approach for Discovering Scalability Bugs in Large Distributed Systems

CA Stuardo, T Leesatapornwongsa… - … USENIX Conference on …, 2019 - usenix.org
We present ScaleCheck, an approach for discovering scalability bugs (a new class of bug in
large storage systems) and for democratizing large-scale testing. ScaleCheck employs a …

∅ sim: Preparing System Software for a World with Terabyte-scale Memories

M Mansi, MM Swift - Proceedings of the Twenty-Fifth International …, 2020 - dl.acm.org
Recent advances in memory technologies mean that commodity machines may soon have
terabytes of memory; however, such machines remain expensive and uncommon today …

Scalability bugs: When 100-node testing is not enough

T Leesatapornwongsa, CA Stuardo… - Proceedings of the 16th …, 2017 - dl.acm.org
We highlight the problem of scalability bugs, a new class of bugs that appear in" cloud-
scale" distributed systems. Scalability bugs are latent bugs that are cluster-scale dependent …

Understanding issue correlations: a case study of the hadoop system

J Huang, X Zhang, K Schwan - … of the Sixth ACM Symposium on Cloud …, 2015 - dl.acm.org
Over the last decade, Hadoop has evolved into a widely used platform for Big Data
applications. Acknowledging its wide-spread use, we present a comprehensive analysis of …