[HTML][HTML] CHAMELEON: reactive load balancing for hybrid MPI+ OpenMP task-parallel applications

J Klinkenberg, P Samfass, M Bader, C Terboven… - Journal of Parallel and …, 2020 - Elsevier
Many applications in high performance computing are designed based on underlying
performance and execution models. While these models could successfully be employed in …

From reactive to proactive load balancing for task‐based parallel applications in distributed memory machines

M Thanh Chung, J Weidendorfer… - Concurrency and …, 2023 - Wiley Online Library
Load balancing is often a challenge in task‐parallel applications. The balancing problems
are divided into static and dynamic.“Static” means that we have some prior knowledge about …

Reactive task migration for hybrid MPI+ OpenMP applications

J Klinkenberg, P Samfass, M Bader, C Terboven… - … Conference on Parallel …, 2019 - Springer
Many applications in high performance computing are designed based on underlying
performance and execution models. While these models could successfully be employed in …

The tiny-tasks granularity trade-off: Balancing overhead versus performance in parallel systems

S Bora, B Walker, M Fidler - IEEE Transactions on Parallel and …, 2023 - ieeexplore.ieee.org
Models of parallel processing systems typically assume that one has workers and jobs are
split into an equal number of tasks. Splitting jobs into smaller tasks, ie using “tiny tasks”, can …

Predictive, reactive and replication-based load balancing of tasks in Chameleon and sam(oa)2

P Samfass, J Klinkenberg, MT Chung… - Proceedings of the …, 2021 - dl.acm.org
Increasingly complex hardware architectures as well as numerical algorithms make
balancing load in parallel numerical software for adaptive mesh refinement an inherently …

teaMPI—replication-based resilience without the (performance) pain

P Samfass, T Weinzierl, B Hazelwood… - … Conference, ISC High …, 2020 - Springer
In an era where we can not afford to checkpoint frequently, replication is a generic way
forward to construct numerical simulations that can continue to run even if hardware parts …

Sabo: Dynamic mpi+ openmp resource balancer

CR Barbosa, P Lemarinier, G Papauré… - 2022 IEEE/ACM Fifth …, 2022 - ieeexplore.ieee.org
Scientific parallel applications often use MPI for inter-node communications and OpenMP for
intra-node orches-tration. Parallel applications such as particle transport, seismic wave …

Lightweight task offloading exploiting MPI wait times for parallel adaptive mesh refinement

P Samfass, T Weinzierl, DE Charrier… - Concurrency and …, 2020 - Wiley Online Library
Balancing the workload of sophisticated simulations is inherently difficult, since we have to
balance both computational workload and memory footprint over meshes that can change …

Towards data-flow parallelization for adaptive mesh refinement applications

K Sala, A Rico, V Beltran - 2020 IEEE International Conference …, 2020 - ieeexplore.ieee.org
Adaptive Mesh Refinement (AMR) is a prevalent method used by distributed-memory
simulation applications to adapt the accuracy of their solutions depending on the turbulent …

Distributed Work Stealing in a Task-Based Dataflow Runtime

J John, J Milthorpe, P Strazdins - International Conference on Parallel …, 2022 - Springer
The task-based dataflow programming model has emerged as an alternative to the process-
centric programming model for extreme-scale applications. However, load balancing is still a …