Total order broadcast and multicast algorithms: Taxonomy and survey

X Défago, A Schiper, P Urbán - ACM Computing Surveys (CSUR), 2004 - dl.acm.org
Total order broadcast and multicast (also called atomic broadcast/multicast) present an
important problem in distributed systems, especially with respect to fault-tolerance. In short …

Scalable state-machine replication

CE Bezerra, F Pedone… - 2014 44th Annual IEEE …, 2014 - ieeexplore.ieee.org
State machine replication (SMR) is a well-known technique able to provide fault-tolerance.
SMR consists of sequencing client requests and executing them against replicas in the …

Advanced pattern recognition for detection of complex software aging phenomena in online transaction processing servers

KJ Cassidy, KC Gross… - … on dependable systems …, 2002 - ieeexplore.ieee.org
Software aging phenomena have been recently studied; one particularly complex type is
shared memory pool latch contention in large OLTP servers. Latch contention onset leads to …

Optimistic parallel state-machine replication

PJ Marandi, F Pedone - 2014 IEEE 33rd International …, 2014 - ieeexplore.ieee.org
State-machine replication, a fundamental approach to fault tolerance, requires replicas to
execute commands deterministically, which usually results in sequential execution of …

Achieving low tail-latency and high scalability for serializable transactions in edge computing

X Chen, H Song, J Jiang, C Ruan, C Li… - Proceedings of the …, 2021 - dl.acm.org
A distributed database utilizing the wide-spread edge computing servers to provide low-
latency data access with the serializability guarantee is highly desirable for emerging edge …

Partial replication in the database state machine

A Sousa, F Pedone, R Oliveira… - … Symposium on Network …, 2001 - ieeexplore.ieee.org
This paper investigates the use of partial replication in the Database State Machine
approach introduced earlier for fully replicated databases. It builds on the order and …

Scalable service-oriented replication with flexible consistency guarantee in the cloud

T Chen, R Bahsoon, ARH Tawil - Information Sciences, 2014 - Elsevier
Replication techniques are widely applied in and for cloud to improve scalability and
availability. In such context, the well-understood problem is how to guarantee consistency …

Geo-replicated storage with scalable deferred update replication

D Sciascia, F Pedone - 2013 43rd Annual IEEE/IFIP …, 2013 - ieeexplore.ieee.org
Many current online services are deployed over geographically distributed sites (ie,
datacenters). Such distributed services call for geo-replicated storage, that is, storage …

On the inherent cost of atomic broadcast and multicast in wide area networks

N Schiper, F Pedone - International Conference on Distributed Computing …, 2008 - Springer
In this paper, we study the atomic broadcast and multicast problems, two fundamental
abstractions for building fault-tolerant systems. As opposed to atomic broadcast, atomic …

EpTO: An epidemic total order algorithm for large-scale distributed systems

M Matos, H Mercier, P Felber, R Oliveira… - Proceedings of the 16th …, 2015 - dl.acm.org
The ordering of events is a fundamental problem of distributed computing and has been
extensively studied over several decades. From all the available orderings, total ordering is …