Cloud storage reliability for big data applications: A state of the art survey

R Nachiappan, B Javadi, RN Calheiros… - Journal of Network and …, 2017 - Elsevier
Cloud storage systems are now mature enough to handle a massive volume of
heterogeneous and rapidly changing data, which is known as Big Data. However, failures …

Efficient testing of recovery code using fault injection

PD Marinescu, G Candea - ACM Transactions on Computer Systems …, 2011 - dl.acm.org
A critical part of developing a reliable software system is testing its recovery code. This code
is traditionally difficult to test in the lab, and, in the field, it rarely gets to run; yet, when it does …

CloudRaid: Detecting distributed concurrency bugs via log mining and enhancement

J Lu, F Li, C Liu, L Li, X Feng… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Cloud systems suffer from distributed concurrency bugs, which often lead to data loss and
service outage. This paper presents CloudRaid, a new automatical tool for finding …

Model-based testing of global properties on large-scale distributed systems

G Sunyé, EC De Almeida, Y Le Traon, B Baudry… - Information and …, 2014 - Elsevier
Context Large-scale distributed systems are becoming commonplace with the large
popularity of peer-to-peer and cloud computing. The increasing importance of these systems …

Hero: Heterogeneity-aware erasure coded redundancy optimal allocation for reliable storage in distributed networks

G Xu, S Lin, G Wang, X Liu, K Shi… - 2012 IEEE 31st …, 2012 - ieeexplore.ieee.org
Heterogeneity is the natural feature in distributed networks. Different from the traditional disk
array, the amount of data allocated on heterogenous peers may be not the same. To …

Automated vulnerability discovery in distributed systems

R Banabic, G Candea… - 2011 IEEE/IFIP 41st …, 2011 - ieeexplore.ieee.org
In this paper we present a technique for automatically assessing the amount of damage a
small number of participant nodes can inflict on the overall performance of a large …

Efficient data reliability management of cloud storage systems for big data applications

R Nachiappan - 2020 - researchdirect.westernsydney.edu …
Cloud service providers are consistently striving to provide efficient and reliable service, to
their client's Big Data storage need. Replication is a simple and flexible method to ensure …

Thermo-Mechanical Coupling Induced Performance Degradation in Storage Systems

S Sondur, K Gross, K Kant - 2020 20th IEEE/ACM International …, 2020 - ieeexplore.ieee.org
This paper explores the coupling between power/thermal aspects of data center and the
vibrations caused by chassis/server fans in environments dominated by mechanical disks …

[引用][C] Uma abordagem para o teste de dependabilidade de sistemas MapReduce com base em casos de falha representativos

[引用][C] Model-Based Testing of Large-Scale Distributed Routing Tables