作者
Qin Zheng, Bharadwaj Veeravalli, Chen-Khong Tham
发表日期
2008/9/19
期刊
IEEE Transactions on Computers
卷号
58
期号
3
页码范围
380-393
出版商
IEEE
简介
Fault-tolerant scheduling is an imperative step for large-scale computational grid systems, as often geographically distributed nodes co-operate to execute a task. By and large, primary-backup approach is a common methodology used for fault tolerance wherein each task has a primary copy and a backup copy on two different processors. In this paper, we identify two cases that may happen when scheduling dependent tasks with primary-backup approach. We derive two important constraints that must be satisfied. Further, we show that these two constraints play a crucial role in limiting the schedulability and overloading efficiency of backups of dependent tasks. We then propose two strategies to improve schedulability and overloading efficiency, respectively. We propose two algorithms (MRC-ECT and MCT-LRC), to schedule backups of independent jobs and dependent jobs, respectively. MRC-ECT is shown to …
引用总数
2008200920102011201220132014201520162017201820192020202120222023202412559118111096111111553