Lessons learned on MPI+ threads communication

R Zambre… - … Conference for High …, 2022 - ieeexplore.ieee.org
Hybrid MPI+ threads programming is gaining prominence, but, in practice, applications
perform slower with it compared to the MPI everywhere model. The most critical challenge to …

Parallel isotope differential modeling for instationary 13C fluxomics at the genome scale

Z Zhang, Z Liu, Y Meng, Z Chen, J Han, Y Wei… - Biotechnology for …, 2020 - Springer
Background A precise map of the metabolic fluxome, the closest surrogate to the
physiological phenotype, is becoming progressively more important in the metabolic …

Logically parallel communication for fast mpi+ threads applications

R Zambre, D Sahasrabudhe, H Zhou… - … on Parallel and …, 2021 - ieeexplore.ieee.org
Supercomputing applications are increasingly adopting the MPI+ threads programming
model over the traditional “MPI everywhere” approach to better handle the disproportionate …

Vectorized barrier and reduction in llvm openmp runtime

MN Farooqi, M Pericàs - International Workshop on OpenMP, 2021 - Springer
Barrier synchronization is a well known operation in parallel processing that can be an
obstacle for getting performance in parallel programs, particularly for high thread counts …

面向众核系统的层次化栅栏同步机制

臧照虎, 李晨, 王耀华, 陈小文, 郭阳 - 计算机工程与科学, 2022 - joces.nudt.edu.cn
同步操作在保证多核处理器线程的数据一致性和正确性等方面起着重要作用.
随着处理器内核数量的不断增加, 同步操作的开销也越来越大. 栅栏同步是并行应用中多核同步 …

Non-blocking technique for parallel algorithms with global barrier synchronization

A Garza, CA Parra, ID Scherson - … International Conference on …, 2021 - ieeexplore.ieee.org
Sharing data among asynchronous processes is considered to be a hard systems problem
in multithreaded modern shared-memory multicore systems. Throughout the literature …

Performance-robust, Non-blocking, Data-driven Barrier Synchronization for Multicore, Multithreaded Parallel Algorithms

A Garza Rodriguez - 2023 - escholarship.org
In a general-purpose multicore parallel multithreaded environment, multiple threads work
simultaneously to finish a task faster. Usually, threads need to communicate with each other …

A hierarchical hardware barrier synchronization design for many-core processors

Z ZANG, C LI, Y WANG, X CHEN… - Computer Engineering & …, 2022 - joces.nudt.edu.cn
Synchronization plays an important role in ensuring data consistency and correctness of
multicore processor threads. As the number of processor cores increases, the cost of …

[图书][B] Optimizing Query Processing Under Skew

W Zhang - 2020 - search.proquest.com
Big data systems such as relational databases, data science platforms, and scientific
workflows all process queries over large and complex datasets. Skew is common in these …

[图书][B] Exascalable Communication for Modern Supercomputing

R Zambre - 2020 - search.proquest.com
Supercomputing applications rely on strong scaling to achieve faster results on a larger
number of processing units. But, at the strong-scaling limit, where communication is a …