The landscape of gpu-centric communication

D Unat, I Turimbetov, MKT Issa, D Sağbili… - arXiv preprint arXiv …, 2024 - arxiv.org
In recent years, GPUs have become the preferred accelerators for HPC and ML applications
due to their parallelism and fast memory bandwidth. While GPUs boost computation, inter …

InfiniBand Verbs on GPU: a case study of controlling an InfiniBand network device from the GPU

L Oden, H Fröning - The International Journal of High …, 2017 - journals.sagepub.com
Due to their massive parallelism and high performance per Watt, GPUs have gained high
popularity in high-performance computing and are a strong candidate for future exascale …

GPU triggered networking for intra-kernel communications

M LeBeane, K Hamidouche, B Benton… - Proceedings of the …, 2017 - dl.acm.org
GPUs are widespread across clusters of compute nodes due to their attractive performance
for data parallel codes. However, communicating between GPUs across the cluster is …

Gpu initiated openshmem: correct and efficient intra-kernel networking for dgpus

K Hamidouche, M LeBeane - Proceedings of the 25th ACM SIGPLAN …, 2020 - dl.acm.org
Current state-of-the-art in GPU networking utilizes a host-centric, kernel-boundary
communication model that reduces performance and increases code complexity. To address …

Design and Implementation of MPI-Native GPU-Initiated MPI Partitioned Communication

YH Temuçin, W Schonbein, S Levy… - SC24-W: Workshops …, 2024 - ieeexplore.ieee.org
Graphics Processing Units have become the dominant type of accelerators for high-
performance computing and artificial intelligence. To support these systems, new …

Analyzing communication models for distributed thread-collaborative processors in terms of energy and time

B Klenk, L Oden, H Froning - 2015 IEEE International …, 2015 - ieeexplore.ieee.org
Accelerated computing has become pervasive for increasing the computational power and
energy efficiency in terms of GFLOPs/Watt. For application areas with highest demands, for …

ComP-net: Command processor networking for efficient intra-kernel communications on GPUs

M LeBeane, K Hamidouche, B Benton… - Proceedings of the 27th …, 2018 - dl.acm.org
Current state-of-the-art in GPU networking advocates a host-centric model that reduces
performance and increases code complexity. Recently, researchers have explored several …

[PDF][PDF] GPU-Centric Communication Schemes: When CPUs Take a Back Seat

I Ismayilov - 2023 - parcorelab.ku.edu.tr
In recent years, GPUs have become the leading accelerator in modern high-performance
systems such that much of HPC computational capability has concentrated in clusters of …

Energy-efficient stencil computations on distributed gpus using dynamic parallelism and gpu-controlled communication

L Oden, B Klenk, H Fröning - 2014 Energy Efficient …, 2014 - ieeexplore.ieee.org
GPUs are widely used in high performance computing, due to their high computational
power and high performance per Watt. Still, one of the main bottlenecks of GPU-accelerated …

Hardware accelerated data processing operations for storage data

JR Feehrer, M Shih, M Cohen, K Chan… - US Patent …, 2021 - Google Patents
(57) ABSTRACT A method and system for processing data are disclosed. A processor, in
response to executing a software program, may write an entry in a work queue. The entry …