PD Marinescu, G Candea - ACM Transactions on Computer Systems …, 2011 - dl.acm.org
A critical part of developing a reliable software system is testing its recovery code. This code is traditionally difficult to test in the lab, and, in the field, it rarely gets to run; yet, when it does …
J Lu, F Li, C Liu, L Li, X Feng… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Cloud systems suffer from distributed concurrency bugs, which often lead to data loss and service outage. This paper presents CloudRaid, a new automatical tool for finding …
Context Large-scale distributed systems are becoming commonplace with the large popularity of peer-to-peer and cloud computing. The increasing importance of these systems …
G Xu, S Lin, G Wang, X Liu, K Shi… - 2012 IEEE 31st …, 2012 - ieeexplore.ieee.org
Heterogeneity is the natural feature in distributed networks. Different from the traditional disk array, the amount of data allocated on heterogenous peers may be not the same. To …
R Banabic, G Candea… - 2011 IEEE/IFIP 41st …, 2011 - ieeexplore.ieee.org
In this paper we present a technique for automatically assessing the amount of damage a small number of participant nodes can inflict on the overall performance of a large …
R Nachiappan - 2020 - researchdirect.westernsydney.edu …
Cloud service providers are consistently striving to provide efficient and reliable service, to their client's Big Data storage need. Replication is a simple and flexible method to ensure …
S Sondur, K Gross, K Kant - 2020 20th IEEE/ACM International …, 2020 - ieeexplore.ieee.org
This paper explores the coupling between power/thermal aspects of data center and the vibrations caused by chassis/server fans in environments dominated by mechanical disks …