Evaluating the performance of HPC-style SYCL applications

T Deakin, S McIntosh-Smith - … of the International Workshop on OpenCL, 2020 - dl.acm.org
SYCL is a parallel programming model for developing single-source programs for running
on heterogeneous platforms. To this end, it allows for one code to be written which can run …

Porting a legacy cuda stencil code to oneapi

S Christgau, T Steinke - 2020 IEEE International Parallel and …, 2020 - ieeexplore.ieee.org
Recently, Intel released the oneAPI programming environment. With Data Parallel C++
(DPC++), oneAPI enables codes to target multiple hardware architectures like multi-core …

Enhancing Kokkos with OpenACC

P Valero-Lara, S Lee… - … Journal of High …, 2024 - journals.sagepub.com
C++ template metaprogramming has emerged as a prominent approach for achieving
performance portability in heterogeneous computing. Kokkos represents a notable paradigm …

Performance Study of GPU applications using SYCL and CUDA on Tesla V100 GPU

GKR Kuncham, R Vaidya… - 2021 IEEE High …, 2021 - ieeexplore.ieee.org
SYCL standard enables single-source programs to run on heterogeneous platforms
consisting of CPUs, GPUs, FPGAs across different hardware vendors. SYCL combines …

Performance portability study of epistasis detection using sycl on nvidia gpu

Z Jin, JS Vetter - Proceedings of the 13th ACM International Conference …, 2022 - dl.acm.org
We describe the experience of converting a CUDA implementation of a high-order epistasis
detection algorithm to SYCL. The goals are for our work to be useful to application and …

A Performance-Portable SYCL Implementation of CRK-HACC for Exascale

EM Rangel, SJ Pennycook, A Pope… - Proceedings of the SC' …, 2023 - dl.acm.org
The first generation of exascale systems will include a variety of machine architectures,
featuring GPUs from multiple vendors. As a result, many developers are interested in …

Case study of using Kokkos and SYCL as performance-portable frameworks for Milc-Dslash benchmark on NVIDIA, AMD and Intel GPUs

AS Dufek, R Gayatri, N Mehta… - … and Productivity in …, 2021 - ieeexplore.ieee.org
Six of the top ten supercomputers in the TOP500 list from June 2021 rely on NVIDIA GPUs to
achieve their peak compute bandwidth. With the announcement of Aurora, Frontier, and El …

KokkACC: enhancing Kokkos with openACC

P Valero-Lara, S Lee… - 2022 Workshop on …, 2022 - ieeexplore.ieee.org
Template metaprogramming is gaining popularity as a high-level solution for achieving
performance portability on heterogeneous computing resources. Kokkos is a representative …

Outcomes of openMP hackathon: openMP application experiences with the offloading model (part II)

B Chapman, B Pham, C Yang, C Daley… - … Enabling Massive Node …, 2021 - Springer
This paper reports on experiences gained and practices adopted when using the latest
features of OpenMP to port a variety of HPC applications and mini-apps based on different …

Evaluating cuda portability with hipcl and dpct

Z Jin, J Vetter - 2021 IEEE International Parallel and Distributed …, 2021 - ieeexplore.ieee.org
HIPCL is expanding the scope of the CUDA portability route from an AMD platform to an
OpenCL platform. In the meantime, the Intel DPC++ Compatibility Tool (DPCT) is migrating a …