MobileRE: A replicas prioritized hybrid fault tolerance strategy for mobile distributed system

Y Wu, D Liu, X Chen, J Ren, R Liu, Y Tan… - Journal of Systems …, 2021 - Elsevier
Fault tolerance techniques are of vital importance to promise data reliability for mobile
distributed system. In mobile environments, nodes suffer from high failure probability and …

Dynamic multiple node failure recovery in distributed storage systems

M Itani, S Sharafeddine, I ElKabani - Ad Hoc Networks, 2018 - Elsevier
Our daily lives are getting more and more dependent on data centers and distributed
storage systems in general, whether at the business or at the personal level. With the advent …

Enabling concurrent failure recovery for regenerating-coding-based storage systems: From theory to practice

R Li, J Lin, PPC Lee - IEEE Transactions on Computers, 2014 - ieeexplore.ieee.org
Data availability is critical in distributed storage systems, especially when node failures are
prevalent in real life. A key requirement is to minimize the amount of data transferred among …

A cost-efficient hybrid redundancy coding scheme for wireless storage systems

A Zhou, N Zhou, B Yi, C Zhu - Computer Communications, 2023 - Elsevier
With distributed storage technique becoming a promising technology for storing massive
data in wireless environment, how to improve the reliability of the storage systems has …

HRSPC: a hybrid redundancy scheme via exploring computational locality to support fast recovery and high reliability in distributed storage systems

S Li, Q Cao, S Wan, L Qian, C Xie - Journal of Network and Computer …, 2016 - Elsevier
Replication and erasure codes are two popular schemes to provide fault tolerance in
distributed storage systems. However, they both face some challenges when used in cloud …

Robust redundancy scheme for the repair process: hierarchical codes in the bandwidth-limited systems

Z Huang, Y Lin, Y Peng - Journal of Grid Computing, 2012 - Springer
High performance computing can be well supported by the Grid or cloud computing systems.
However, these systems have to overcome the failure risks, where data is stored in the …

On the impact of erasure coding parameters to the reliability of distributed brick storage systems

X Luo, Y Wang, Z Shen - 2009 International Conference on …, 2009 - ieeexplore.ieee.org
For a given amount of storage overhead, erasure coding offers a higher degree of
survivability than pure replication. Consequently, erasure coding attracts much attention …

Heterogeneity-aware codes with uncoded repair for distributed storage systems

B Zhu, KW Shum, H Li - IEEE Communications Letters, 2015 - ieeexplore.ieee.org
In practical large-scale distributed storage systems, node failures are unavoidable. It is
therefore desirable to quickly recreate the failed nodes in order to maintain the system …

REDU: reducing redundancy and duplication for multi-failure recovery in erasure-coded storages

J Zhang, S Li, X Liao - The Journal of Supercomputing, 2016 - Springer
Data reliability is a significant issue in large-scale storage systems. Erasure codes provide
high data reliability via data recovery, which however generates a large amount of data …

Practical multiple node failure recovery in distributed storage systems

M Itani, S Sharafeddine… - 2016 IEEE Symposium on …, 2016 - ieeexplore.ieee.org
As multiple node failures are becoming so frequent in distributed storage systems, many
erasure coding techniques are emerging to handle such failures. In this paper we use the …