Assessing opportunities of SYCL for biological sequence alignment on GPU-based systems

M Costanzo, E Rucci, C García-Sanchez… - The Journal of …, 2024 - Springer
Bioinformatics and computational biology are two fields that have been exploiting GPUs for
more than two decades, with being CUDA the most used programming language for them …

A Performance-Portable SYCL Implementation of CRK-HACC for Exascale

EM Rangel, SJ Pennycook, A Pope… - Proceedings of the SC' …, 2023 - dl.acm.org
The first generation of exascale systems will include a variety of machine architectures,
featuring GPUs from multiple vendors. As a result, many developers are interested in …

[HTML][HTML] Enabling performance portability on the LiGen drug discovery pipeline

L Crisci, L Carpentieri, B Cosenza, G Accordi… - Future Generation …, 2024 - Elsevier
In recent years, there has been a growing interest in developing high-performance
implementations of drug discovery processing software. To target modern GPU …

Experiences building an mlir-based sycl compiler

E Tiotto, V Pérez, W Tsang, L Sommer… - 2024 IEEE/ACM …, 2024 - ieeexplore.ieee.org
Similar to other programming models, compilers for SYCL, the open programming model for
heterogeneous computing based on C++, would benefit from access to higher-level …

Analyzing the Performance Portability of SYCL across CPUs, GPUs, and Hybrid Systems with Protein Database Search

M Costanzo, E Rucci, C García-Sánchez… - arXiv preprint arXiv …, 2024 - arxiv.org
The high-performance computing (HPC) landscape is undergoing rapid transformation, with
an increasing emphasis on energy-efficient and heterogeneous computing environments …

Assessing Opportunities of SYCL and Intel oneAPI for Biological Sequence Alignment

M Costanzo, E Rucci, CG Sánchez, M Naiouf… - arXiv preprint arXiv …, 2022 - arxiv.org
Bioinformatics and Computational Biology are two fields that have been exploiting GPUs for
more than two decades, with being CUDA the most used programming language for them …

Open SYCL on heterogeneous GPU systems: A case of study

R Carratalá-Sáez, Y Torres… - arXiv preprint arXiv …, 2023 - arxiv.org
Computational platforms for high-performance scientific applications are becoming more
heterogenous, including hardware accelerators such as multiple GPUs. Applications in a …

Extending the SYCL Joint Matrix for Binarized Neural Networks

Z Jin - 2024 IEEE International Parallel and Distributed …, 2024 - ieeexplore.ieee.org
In contrast to the warp matrix-multiplication application-programming interface (WMMA) for
tensor hardware programming in Compute Unified Device Architecture (CUDA), the SYCL …

Comparing Performance and Portability Between CUDA and SYCL for Protein Database Search on NVIDIA, AMD, and Intel GPUs

M Costanzo, E Rucci, C García-Sánchez… - 2023 IEEE 35th …, 2023 - ieeexplore.ieee.org
The heterogeneous computing paradigm has led to the need for portable and efficient
programming solutions that can leverage the capabilities of various hardware devices, such …

New generation of GPGPU and related hardware: computing systems microarchitecture and performance from servers to supercomputers

MB Kuzminsky - Программные системы: теория и приложения, 2024 - mathnet.ru
An overview of the current state of GPGPUs is given, with orientation towards their using to
traditional HPC tasks (and less to AI). The basic GPGPUs in the review include Nvidia V100 …