Task-Level Checkpointing and Localized Recovery to Tolerate Permanent Node Failures for Nested Fork–Join Programs in Clusters

L Reitz, C Fohry - SN Computer Science, 2024 - Springer
Exascale supercomputers consist of millions of processing units, and this number is still
growing. Therefore, hardware failures, such as permanent node failures, become …

Optimizing Cloud Service Load Balancing Through Heat Conduction Equation Applications.

H He, L Wang, J Liu, L Qin - International Journal of Heat & …, 2024 - search.ebscohost.com
With the rapid development of cloud computing, load balancing technology in cloud services
has become a critical component in ensuring service quality and system stability. Traditional …

Distributed Asynchronous Contact Mechanics with DARMA/vt

N Morales, R Jones, J Lifflander, PP Pébaÿ… - … on Asynchronous Many …, 2024 - Springer
Contact mechanics, or the modeling of the impenetrability of solid objects, is fundamental to
computational solid mechanics (CSM) applications yet is oftentimes the most challenging in …

Vector load balancing for high-performance parallel applications

R Buch - 2023 - ideals.illinois.edu
Load balancing, modifying the distribution of work across a system to optimize resource
usage, is vital for achieving scalability and high performance for dynamic parallel …

Distributed Asynchronous Contact Mechanics with DARMA/vt

S McGovern, C Skrzyński, C Schilly - … , TN, USA, February 14–16, 2024 … - books.google.com
Contact mechanics, or the modeling of the impenetrability of solid objects, is fundamental to
computational solid mechanics (CSM) applications yet is oftentimes the most challenging in …