GPU virtualization and scheduling methods: A comprehensive survey

CH Hong, I Spence, DS Nikolopoulos - ACM Computing Surveys (CSUR …, 2017 - dl.acm.org
The integration of graphics processing units (GPUs) on high-end compute nodes has
established a new accelerator-based heterogeneous computing model, which now …

A complete and efficient CUDA-sharing solution for HPC clusters

AJ Pena, C Reaño, F Silla, R Mayo, ES Quintana-Ortí… - Parallel Computing, 2014 - Elsevier
In this paper we detail the key features, architectural design, and implementation of rCUDA,
an advanced framework to enable remote and transparent GPGPU acceleration in HPC …

Composable architecture for rack scale big data computing

CS Li, H Franke, C Parris, B Abali, M Kesavan… - Future Generation …, 2017 - Elsevier
The rapid growth of cloud computing, both in terms of the spectrum and volume of cloud
workloads, necessitates re-visiting the traditional rack-mountable servers based datacenter …

On the virtualization of CUDA based GPU remoting on ARM and X86 machines in the GVirtuS framework

R Montella, G Giunta, G Laccetti, M Lapegna… - International Journal of …, 2017 - Springer
The astonishing development of diverse and different hardware platforms is twofold: on one
side, the challenge for the exascale performance for big data processing and management; …

High performance in the cloud with FPGA groups

A Iordache, G Pierre, P Sanders… - Proceedings of the 9th …, 2016 - dl.acm.org
Field-programmable gate arrays (FPGAs) can offer invaluable computational performance
for many compute-intensive algorithms. However, to justify their purchase and administration …

dCUDA: hardware supported overlap of computation and communication

T Gysi, J Bär, T Hoefler - SC'16: Proceedings of the …, 2016 - ieeexplore.ieee.org
Over the last decade, CUDA and the underlying GPU hardware architecture have
continuously gained popularity in various high-performance computing application domains …

Adaptive subcarrier nulling: Enabling partial spectrum sharing in wireless LANs

X Zhang, KG Shin - 2011 19th IEEE International Conference …, 2011 - ieeexplore.ieee.org
Emerging WLAN standards have been incorporating a variety of channel widths ranging
from 5MHz to 160MHz, in order to match the diverse traffic demands on different networks …

SLURM support for remote GPU virtualization: Implementation and performance study

S Iserte, A Castelló, R Mayo… - 2014 IEEE 26th …, 2014 - ieeexplore.ieee.org
SLURM is a resource manager that can be leveraged to share a collection of heterogeneous
resources among the jobs in execution in a cluster. However, SLURM is not designed to …

qCUDA: GPGPU virtualization for high bandwidth efficiency

YS Lin, CY Lin, CR Lee… - 2019 IEEE International …, 2019 - ieeexplore.ieee.org
The increasing demand for machine learning computation contributes to the convergence of
high-performance computing and cloud computing, in which the virtualization of Graphics …

Enhancing the rCUDA remote GPU virtualization framework: From a prototype to a production solution

C Reaño, F Silla, J Duato - 2017 17th IEEE/ACM International …, 2017 - ieeexplore.ieee.org
The use of hardware accelerators to increase the performance of parallel applications is
very common nowadays. For a number of reasons, however, the access to local accelerators …