Analytical model on hybrid state saving with a limited number of checkpoints and bound rollbacks

M Ohara, R Suzuki, M Arai, S Fukumoto… - IEICE transactions on …, 2006 - search.ieice.org
M Ohara, R Suzuki, M Arai, S Fukumoto, K Iwasaki
IEICE transactions on fundamentals of electronics, communications and …, 2006search.ieice.org
This paper discusses distributed checkpointing with logging for practical applications
running with limited resources. We present a discrete time model evaluating the total
expected overhead per event where the number of available checkpoints that each process
can hold is finite. The rollback distance is also bound to some finite interval in many actual
applications. Therefore, the recovery overhead for the checkpointing scheme is described by
using a truncated geometric distribution as the rollback distance distribution. Although it is …
This paper discusses distributed checkpointing with logging for practical applications running with limited resources. We present a discrete time model evaluating the total expected overhead per event where the number of available checkpoints that each process can hold is finite. The rollback distance is also bound to some finite interval in many actual applications. Therefore, the recovery overhead for the checkpointing scheme is described by using a truncated geometric distribution as the rollback distance distribution. Although it is difficult to analytically derive the optimal checkpoint interval, which minimizes the total expected overhead, substituting other simple probabilistic distributions instead of the truncated geometric distribution enables us to do this explicitly. Numerical examples obtained through simulations are presented to show that we can achieve almost minimized total overhead by using the new models and analyses.
search.ieice.org
以上显示的是最相近的搜索结果。 查看全部搜索结果

Google学术搜索按钮

example.edu/paper.pdf
搜索
获取 PDF 文件
引用
References