The implementation of Dynamite: An environment for migrating PVM tasks

V Hamscher, U Schwiegelshohn, A Streit… - Grid Computing—GRID …, 2000 - Springer

In this paper, we discuss typical scheduling structures that occur in computational grids.
Scheduling algorithms and selection strategies applicable to these structures are introduced …

被引用次数：465 相关文章所有 17 个版本

[PDF] acm.org

The distributed ASCI supercomputer project

H Bal, R Bhoedjang, R Hofman, C Jacobs… - ACM SIGOPS …, 2000 - dl.acm.org

The Distributed ASCI Supercomputer (DAS) is a homogeneous wide-area distributed system
consisting of four cluster computers at different locations. DAS has been used for research …

被引用次数：146 相关文章所有 17 个版本

Dynamic load balancing in parallel execution of cellular automata

A Giordano, A De Rango, R Rongo… - … on Parallel and …, 2020 - ieeexplore.ieee.org

The allocation of the computational load across different processing elements is an
important issue in parallel computing. Indeed, an unbalanced load distribution can strongly …

被引用次数：21 相关文章所有 3 个版本

[PDF] vu.nl

Supporting internet-scale multi-agent systems

NJE Wijngaards, BJ Overeinder, M van Steen… - Data & Knowledge …, 2002 - Elsevier

The Internet provides a large-scale environment for (intelligent) software agents. Agents are
autonomous (mobile) processes, capable of communication with other agents, interaction …

被引用次数：128 相关文章所有 22 个版本

[PDF] mit.edu

[PDF][PDF] Transparent user-level checkpointing for the native posix thread library for linux.

M Rieker, J Ansel, G Cooperman - PDPTA, 2006 - people.csail.mit.edu

Checkpointing of single-threaded applications has been long studied [3],[6],[8],[12],[15].
Much less research has been done for user-level checkpointing of multithreaded …

被引用次数：73 相关文章所有 10 个版本

A fault-tolerant hybrid resource allocation model for dynamic computational grid

S Sheikh, A Nagaraju, M Shahid - Journal of Computational Science, 2021 - Elsevier

Effectual allocation of resources with fault tolerance is one of the key targets in any
computational grid environment to accomplish the task execution on time. In this paper, a …

被引用次数：14 相关文章

[PDF] uniroma2.it

Current practice and a direction forward in checkpoint/restart implementations for fault tolerance

JC Sancho, F Petrini, K Davis… - 19th IEEE …, 2005 - ieeexplore.ieee.org

Checkpoint/restart is a general idea for which particular implementations enable various
functionalities in computer systems, including process migration, gang scheduling …

被引用次数：78 相关文章所有 9 个版本

[HTML] proquest.com

[图书][B] Coordinated checkpoint/restart process fault tolerance for MPI applications on HPC systems

J Hursey - 2010 - search.proquest.com

Scientists use advanced computing techniques to assist in answering the complex questions
at the forefront of discovery. The High Performance Computing (HPC) scientific applications …

被引用次数：45 相关文章所有 6 个版本

[PDF] researchgate.net

User-level process checkpoint and restore for migration

M Bozyigit, M Wasiq - ACM SIGOPS Operating Systems Review, 2001 - dl.acm.org

In simple words, process checkpointing means saving the state of a process, so that, it can
be reconstructed in the future. Checkpointing followed by restore is important for the purpose …

被引用次数：47 相关文章所有 5 个版本

[PDF] vu.nl

Agent factory: Generative migration of mobile agents in heterogeneous environments

FMT Brazier, BJ Overeinder, M van Steen… - Proceedings of the …, 2002 - dl.acm.org

In most of today's agent systems migration of agents requires homogeneity in the
programming language and/or agent platform in which an agent has been designed. In this …

被引用次数：73 相关文章所有 13 个版本

高级搜索

QQ 群