W Xue, C Yang, H Fu, X Wang, Y Xu… - IEEE Transactions …, 2014 - ieeexplore.ieee.org
In this work an ultra-scalable algorithm is designed and optimized to accelerate a 3D compressible Euler atmospheric model on the CPU-MIC hybrid system of Tianhe-2. We first …
Data-level parallelism is frequently ignored or underutilized. Achieved through vector/SIMD capabilities, it can provide substantial performance improvements on top of widely used …
Data-level parallelism is frequently ignored or underutilized. Achieved through vector/SIMD capabilities, it can provide substantial performance improvements on top of widely used …
The multidimensional positive definite advection transport algorithm (MPDATA) belongs to the group of nonoscillatory forward‐in‐time algorithms and performs a sequence of stencil …
L Szustak, K Halbiniak, L Kuczynski… - … Journal of High …, 2018 - journals.sagepub.com
Modern heterogeneous computing platforms have become powerful HPC solutions, which could be applied to a wide range of real-life applications. In particular, the hybrid platforms …
L Szustak, P Bratek - The International Journal of High …, 2019 - journals.sagepub.com
In this work, we take up the challenge of performance portable programming of heterogeneous stencil computations across a wide range of modern shared-memory …
This paper meets the challenge of harnessing the heterogeneous communication architecture of ccNUMA multiprocessors for heterogeneous stencil computations, an …
L Szustak, R Wyrzykowski, O Jakl - … , September 4-8, 2017, Proceedings 14, 2017 - Springer
SMP/NUMA systems are powerful HPC platforms which could be applied for a wide range of real-life applications. These systems provide large capacity of shared memory, and allow …
The main goal of this paper is the suitability assessment of the OpenMP Accelerator Model (OMPAM) for porting a real-life scientific application to heterogeneous platforms containing a …