Influence of InfiniBand FDR on the performance of remote GPU virtualization

CH Hong, I Spence, DS Nikolopoulos - ACM Computing Surveys (CSUR …, 2017 - dl.acm.org

The integration of graphics processing units (GPUs) on high-end compute nodes has
established a new accelerator-based heterogeneous computing model, which now …

被引用次数：121 相关文章所有 7 个版本

[PDF] sciencedirect.com

A complete and efficient CUDA-sharing solution for HPC clusters

AJ Pena, C Reaño, F Silla, R Mayo, ES Quintana-Ortí… - Parallel Computing, 2014 - Elsevier

In this paper we detail the key features, architectural design, and implementation of rCUDA,
an advanced framework to enable remote and transparent GPGPU acceleration in HPC …

被引用次数：136 相关文章所有 13 个版本

[PDF] soton.ac.uk

Composable architecture for rack scale big data computing

CS Li, H Franke, C Parris, B Abali, M Kesavan… - Future Generation …, 2017 - Elsevier

The rapid growth of cloud computing, both in terms of the spectrum and volume of cloud
workloads, necessitates re-visiting the traditional rack-mountable servers based datacenter …

被引用次数：71 相关文章所有 6 个版本

[PDF] bwise.kr

On the virtualization of CUDA based GPU remoting on ARM and X86 machines in the GVirtuS framework

R Montella, G Giunta, G Laccetti, M Lapegna… - International Journal of …, 2017 - Springer

The astonishing development of diverse and different hardware platforms is twofold: on one
side, the challenge for the exascale performance for big data processing and management; …

被引用次数：42 相关文章所有 11 个版本

[PDF] hal.science

High performance in the cloud with FPGA groups

A Iordache, G Pierre, P Sanders… - Proceedings of the 9th …, 2016 - dl.acm.org

Field-programmable gate arrays (FPGAs) can offer invaluable computational performance
for many compute-intensive algorithms. However, to justify their purchase and administration …

被引用次数：38 相关文章所有 7 个版本

[PDF] ethz.ch

dCUDA: hardware supported overlap of computation and communication

T Gysi, J Bär, T Hoefler - SC'16: Proceedings of the …, 2016 - ieeexplore.ieee.org

Over the last decade, CUDA and the underlying GPU hardware architecture have
continuously gained popularity in various high-performance computing application domains …

被引用次数：38 相关文章所有 31 个版本

[PDF] psu.edu

Adaptive subcarrier nulling: Enabling partial spectrum sharing in wireless LANs

X Zhang, KG Shin - 2011 19th IEEE International Conference …, 2011 - ieeexplore.ieee.org

Emerging WLAN standards have been incorporating a variety of channel widths ranging
from 5MHz to 160MHz, in order to match the diverse traffic demands on different networks …

被引用次数：48 相关文章所有 9 个版本

[PDF] upv.es

SLURM support for remote GPU virtualization: Implementation and performance study

S Iserte, A Castelló, R Mayo… - 2014 IEEE 26th …, 2014 - ieeexplore.ieee.org

SLURM is a resource manager that can be leveraged to share a collection of heterogeneous
resources among the jobs in execution in a cluster. However, SLURM is not designed to …

被引用次数：34 相关文章所有 9 个版本

[PDF] nthu.edu.tw

qCUDA: GPGPU virtualization for high bandwidth efficiency

YS Lin, CY Lin, CR Lee… - 2019 IEEE International …, 2019 - ieeexplore.ieee.org

The increasing demand for machine learning computation contributes to the convergence of
high-performance computing and cloud computing, in which the virtualization of Graphics …

被引用次数：13 相关文章所有 7 个版本

Enhancing the rCUDA remote GPU virtualization framework: From a prototype to a production solution

C Reaño, F Silla, J Duato - 2017 17th IEEE/ACM International …, 2017 - ieeexplore.ieee.org

The use of hardware accelerators to increase the performance of parallel applications is
very common nowadays. For a number of reasons, however, the access to local accelerators …

被引用次数：15 相关文章所有 3 个版本

高级搜索

QQ 群