Towards automatically checking thousands of failures with micro-specifications

R Nachiappan, B Javadi, RN Calheiros… - Journal of Network and …, 2017 - Elsevier

Cloud storage systems are now mature enough to handle a massive volume of
heterogeneous and rapidly changing data, which is known as Big Data. However, failures …

被引用次数：132 相关文章所有 5 个版本

[PDF] epfl.ch

Efficient testing of recovery code using fault injection

PD Marinescu, G Candea - ACM Transactions on Computer Systems …, 2011 - dl.acm.org

A critical part of developing a reliable software system is testing its recovery code. This code
is traditionally difficult to test in the lab, and, in the field, it rarely gets to run; yet, when it does …

被引用次数：79 相关文章所有 9 个版本

[PDF] lujie.ac.cn

CloudRaid: Detecting distributed concurrency bugs via log mining and enhancement

J Lu, F Li, C Liu, L Li, X Feng… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org

Cloud systems suffer from distributed concurrency bugs, which often lead to data loss and
service outage. This paper presents CloudRaid, a new automatical tool for finding …

被引用次数：10 相关文章所有 3 个版本

[PDF] uni.lu

Model-based testing of global properties on large-scale distributed systems

G Sunyé, EC De Almeida, Y Le Traon, B Baudry… - Information and …, 2014 - Elsevier

Context Large-scale distributed systems are becoming commonplace with the large
popularity of peer-to-peer and cloud computing. The increasing importance of these systems …

被引用次数：17 相关文章所有 9 个版本

Hero: Heterogeneity-aware erasure coded redundancy optimal allocation for reliable storage in distributed networks

G Xu, S Lin, G Wang, X Liu, K Shi… - 2012 IEEE 31st …, 2012 - ieeexplore.ieee.org

Heterogeneity is the natural feature in distributed networks. Different from the traditional disk
array, the amount of data allocated on heterogenous peers may be not the same. To …

被引用次数：12 相关文章所有 3 个版本

[PDF] epfl.ch

Automated vulnerability discovery in distributed systems

R Banabic, G Candea… - 2011 IEEE/IFIP 41st …, 2011 - ieeexplore.ieee.org

In this paper we present a technique for automatically assessing the amount of damage a
small number of participant nodes can inflict on the overall performance of a large …

被引用次数：11 相关文章所有 14 个版本

[PDF] westernsydney.edu.au

Efficient data reliability management of cloud storage systems for big data applications

R Nachiappan - 2020 - researchdirect.westernsydney.edu …

Cloud service providers are consistently striving to provide efficient and reliable service, to
their client's Big Data storage need. Replication is a simple and flexible method to ensure …

Thermo-Mechanical Coupling Induced Performance Degradation in Storage Systems

S Sondur, K Gross, K Kant - 2020 20th IEEE/ACM International …, 2020 - ieeexplore.ieee.org

This paper explores the coupling between power/thermal aspects of data center and the
vibrations caused by chassis/server fans in environments dominated by mechanical disks …

[引用][C] Uma abordagem para o teste de dependabilidade de sistemas MapReduce com base em casos de falha representativos

JE Marynowski

[引用][C] Model-Based Testing of Large-Scale Distributed Routing Tables

G Sunyé, B Baudry, JM Jézéquel, Y Le Traon

高级搜索

QQ 群