IC Garcia, G Vieira, LE Buzato - arXiv preprint arXiv:1702.06167, 2017 - arxiv.org
The literature on communication-induced checkpointing presents a family of protocols that use logical clocks to control whether forced checkpoints must be taken. Efficiency of these …
JM Helary, A Mostefaoui, RHB Netzer… - Proceedings of SRDS' …, 1997 - ieeexplore.ieee.org
A useless checkpoint is a local checkpoint that cannot be part of a consistent global checkpoint. The paper addresses the following important problem. Given a set of processes …
Stream processing in the last decade has seen broad adoption in both commercial and research settings. One key element for this success is the ability of modern stream …
M Raynal - The French School of Programming, 2023 - Springer
Similarly to the injunction “Know yourself” engraved on the frontispiece of Delphi's temple more than two millennia ago, the sentence “Make it as simple as possible, but not …
Y Luo, D Manivannan - Journal of Parallel and Distributed Computing, 2009 - Elsevier
Communication-Induced Checkpointing (CIC) protocols are classified into two categories in the literature: Index-based and Model-based. In this paper, we discuss two data structures …
S Bouchenak, D Hagimont, S Krakowiak… - Software: Practice …, 2004 - Wiley Online Library
Today, mobility and persistence are important aspects of distributed computing. They have many fields of use such as load balancing, fault tolerance and dynamic reconfiguration of …
Existing algorithms for global snapshots in distributed systems are not scalable when the underlying topology is complete. There are primarily two classes of existing algorithms for …
Y Fan, C Farn - … Conference on System Sciences (HICSS'07), 2007 - ieeexplore.ieee.org
In order to reduce the number of vehicles stuck in congestion, especially for stop-and-go traffic at toll plazas, the establishment of electronic toll collection (ETC) system has been a …
Y Luo, D Manivannan - Future Generation Computer Systems, 2012 - Elsevier
Future generation supercomputers will be message-passing distributed systems consisting of hundreds of thousands of processors. As the size of the system grows, failure rate …