Fault tolerance in distributed systems using fused data structures

B Balasubramanian, VK Garg - IEEE transactions on parallel …, 2012 - ieeexplore.ieee.org
Replication is the prevalent solution to tolerate faults in large data structures hosted on
distributed servers. To tolerate f crash faults (dead/unresponsive data structures) among n …

Implementing fault-tolerant services using state machines: Beyond replication

VK Garg - International Symposium on Distributed Computing, 2010 - Springer
This paper describes a method to implement fault-tolerant services in distributed systems
based on the idea of fused state machines. The theory of fused state machines uses a …

Fused data structures for handling multiple faults in distributed systems

B Balasubramanian, VK Garg - 2011 31st International …, 2011 - ieeexplore.ieee.org
The paper describes a technique to correct crash faults in large data structures hosted on
distributed servers, based on the concept of fused backups. The prevalent solution to this …

Fault tolerance in distributed systems using fused state machines

B Balasubramanian, VK Garg - Distributed computing, 2014 - Springer
Replication is a standard technique for fault tolerance in distributed systems modeled as
deterministic finite state machines (DFSMs or machines). To correct ff crash or ⌊ f/2 ⌋⌊ f/2⌋ …

Fault tolerance in distributed systems using fused data structures with the help of LT codes

K Rajkumar, P Swaminathan - International Journal of …, 2016 - inderscienceonline.com
To tolerate the crash faults among many different data structures which requires replication
of every data structure, resulting in some number of additional or extra backups. It is to …

Fused state machines for fault tolerance in distributed systems

B Balasubramanian, VK Garg - … 2011, Toulouse, France, December 13-16 …, 2011 - Springer
Replication is a standard technique for fault-tolerance in distributed systems modeled as
deterministic finite state machines (DFSMs or machines). To correct f crash faults among n …

An Improved Multiple Faults Reassignment based Recovery in Cluster Computing

S Bansal, S Sharma - arXiv preprint arXiv:1102.2616, 2011 - arxiv.org
In case of multiple node failures performance becomes very low as compare to single node
failure. Failures of nodes in cluster computing can be tolerated by multiple fault tolerant …

[PDF][PDF] A Fusion-based Approach for Handling Multiple Faults in Distributed Systems

B Balasubramanian, VK Garg - 2010 - users.ece.utexas.edu
The paper describes a technique to correct faults in large data structures hosted on
distributed servers, based on the concept of fused backups. The prevalent solution to this …

[PDF][PDF] APPLICATION LEVEL CHECKPOINT-BASED APPROACH FOR CRUSH FAILURE IN DISTRIBUTED SYSTEM

MM Khaing, PP Tar - academia.edu
Fault-tolerance is an important and critical issue in distributed and parallel processing
system. Distributed system consists of a collection of interconnected stand-alone computers …

[PDF][PDF] Fusion-based DFSMs for Fault Tolerance in Distributed Systems

B Balasubramanian, VK Garg - users.ece.utexas.edu
Replication is a standard solution for fault-tolerance in distributed systems modeled as
deterministic finite state machines (DFSMs or machines). To correct f crash faults among n …