COMPI: Concolic testing for MPI applications

H Li, S Li, Z Benavides, Z Chen… - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
MPI is widely used as the bedrock of HPC applications, but there are no effective systematic
software testing techniques for MPI programs. In this paper we develop COMPI, the first …

Jdebug: a fast, non-intrusive and scalable fault locating tool for ten-million-scale parallel applications

D Peng, Y Feng, Y Liu, X Liu, W Xue… - … on Parallel and …, 2022 - ieeexplore.ieee.org
This article presents Jdebug, a fast, non-intrusive and scalable fault locating tool for extreme-
scale parallel applications. Large-scale debugging has drawn more attention with the …

MPIRace: A Static Data Race Detector for MPI Programs

W Wang - International Workshop on Languages and Compilers …, 2022 - Springer
Data races in distributed parallel programs, such as those developed with the message
passing interface (MPI), can cause critical correctness and reliability issues. Therefore, it is …

Protecting Synchronization Mechanisms of Parallel Big Data Kernels via Logging

T LeCompte, L Peng, X Yuan… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
With the growing effort to reduce power consumption in machines, fault tolerance becomes
more of a concern. This holds particularly for large-scale computing, where execution …

A Hybrid Approach for Detecting Bugs in HPC Workloads

S Sherpa, X Zhao - 2023 IEEE 29th International Conference on …, 2023 - ieeexplore.ieee.org
MPI programs represent a major type of workloads running on parallel and distributed
systems: tightly coupled high performance computing (HPC) workloads which use MPI to …

ELS: Emulation system for debugging and tuning large-scale parallel programs on small clusters

F Lin, Y Liu, Y Guo, D Qian - The Journal of Supercomputing, 2021 - Springer
Continuous scaling-up of high-performance computing systems has brought challenges to
the debugging and tuning of large-scale parallel programs. Firstly, to locate bugs in a …

Non-intrusively avoiding scaling problems in and out of mpi collectives

H Li, Z Chen, R Gupta, M Xie - 2018 IEEE International Parallel …, 2018 - ieeexplore.ieee.org
It has been observed that scaling problems are highly likely to manifest when MPI
applications are launched at a large scale where the scale is characterized by the degree of …

[PDF][PDF] Detection of bugs in MPI programs using a hybrid approach

S Sherpa - 2020 - rex.libraries.wsu.edu
First I'd like to express my sincere gratitude to my advisor Dr. Xinghui Zhao, for her guidance,
patience, motivation and immense knowledge. Her guidance was instrumental to making my …

[图书][B] Pre-and Post-Deployment Dynamic Bug Detection Techniques for MPI Programs

H Li - 2018 - search.proquest.com
MPI is the de-facto standard message-passing based parallel programming model.
However, the bug detection support for MPI applications is lacking. This thesis seeks to …