Reproducible, scalable, and shareable analysis pipelines with bioinformatics workflow managers

L Wratten, A Wilm, J Göke - Nature methods, 2021 - nature.com
The rapid growth of high-throughput technologies has transformed biomedical research.
With the increasing amount and complexity of data, scalability and reproducibility have …

A survey on checkpointing strategies: Should we always checkpoint à la Young/Daly?

L Bautista-Gomez, A Benoit, S Di, T Herault… - Future Generation …, 2024 - Elsevier
Abstract The Young/Daly formula provides an approximation of the optimal checkpointing
period for a parallel application executing on a supercomputing platform. It was originally …

Reliability-aware cost-efficient scientific workflows scheduling strategy on multi-cloud systems

X Tang - IEEE Transactions on Cloud Computing, 2021 - ieeexplore.ieee.org
Nowadays, more and more computation-intensive scientific applications with diverse needs
are migrating to cloud computing systems. However, the cloud systems alone cannot meet …

An improved list-based task scheduling algorithm for fog computing environment

R Madhura, BL Elizabeth, VR Uthariaraj - Computing, 2021 - Springer
A high-performance execution of programs predominately depends on the efficient
scheduling of tasks. An application consists of a sequence of tasks that can be represented …

Reliability-aware and energy-efficient workflow scheduling in iaas clouds

L Ye, Y Xia, S Tao, C Yan, R Gao… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Nowadays, more and more workflow applications with different computing requirements are
migrated to clouds and executed with cloud resources. Workflow scheduling becomes a …

Reliable budget aware workflow scheduling strategy on multi-cloud environment

KK Chakravarthi, P Neelakantan, L Shyamala… - Cluster …, 2022 - Springer
The resource provisioning and workflow execution in a multi-cloud environment using a pay-
as-you-use framework have recently gained the attention of the cloud computing research …

On the design of reactive approach with flexible checkpoint interval to tolerate faults in cloud computing systems

M Amoon, N El-Bahnasawy, S Sadi… - Journal of Ambient …, 2019 - Springer
The likelihood of failures rises in cloud computing systems as a result of their unstable
nature. Additionally, the size of a cloud computing system varies with time and thus failures …

A generic approach to scheduling and checkpointing workflows

L Han, V Le Fèvre, LC Canon, Y Robert… - Proceedings of the 47th …, 2018 - dl.acm.org
This work deals with scheduling and checkpointing strategies to execute scientific workflows
on failure-prone large-scale platforms. To the best of our knowledge, this work is the first to …

Resource allocation and aging priority-based scheduling of linear workflow applications with transient failures and selective imprecise computations

HD Karatza, GL Stavrinides - Cluster Computing, 2024 - Springer
A wide range of applications in distributed environments have a linear structure, varying
priorities, and may experience transient software failures. As the computational demands of …

Checkpointing strategies to tolerate non-memoryless failures on HPC platforms

A Benoit, L Perotin, Y Robert, F Vivien - ACM Transactions on Parallel …, 2024 - dl.acm.org
This article studies checkpointing strategies for parallel applications subject to failures. The
optimal strategy to minimize total execution time, or makespan, is well known when failure …