GPU virtualization and scheduling methods: A comprehensive survey

CH Hong, I Spence, DS Nikolopoulos - ACM Computing Surveys (CSUR …, 2017 - dl.acm.org
The integration of graphics processing units (GPUs) on high-end compute nodes has
established a new accelerator-based heterogeneous computing model, which now …

Telekine: Secure computing with cloud {GPUs}

T Hunt, Z Jia, V Miller, A Szekely, Y Hu… - … USENIX Symposium on …, 2020 - usenix.org
GPUs have become ubiquitous in the cloud due to the dramatic performance gains they
enable in domains such as machine learning and computer vision. However, offloading …

Mask: Redesigning the gpu memory hierarchy to support multi-application concurrency

R Ausavarungnirun, V Miller, J Landgraf… - ACM SIGPLAN …, 2018 - dl.acm.org
Graphics Processing Units (GPUs) exploit large amounts of threadlevel parallelism to
provide high instruction throughput and to efficiently hide long-latency stalls. The resulting …

DGSF: Disaggregated GPUs for serverless functions

H Fingler, Z Zhu, E Yoon, Z Jia… - 2022 IEEE …, 2022 - ieeexplore.ieee.org
Ease of use and transparent access to elastic resources have attracted many applications
away from traditional platforms toward serverless functions. Many of these applications, such …

Accelerated serverless computing based on GPU virtualization

DM Naranjo, S Risco, C de Alfonso, A Pérez… - Journal of Parallel and …, 2020 - Elsevier
This paper introduces a platform to support serverless computing for scalable event-driven
data processing that features a multi-level elasticity approach combined with virtualization of …

State‐of‐the‐Art Report in Web‐based Visualization

F Mwalongo, M Krone, G Reina… - Computer graphics …, 2016 - Wiley Online Library
In this report, we review the current state of the art of web‐based visualization applications.
Recently, an increasing number of web‐based visualization applications have emerged …

AvA: Accelerated virtualization of accelerators

H Yu, AM Peters, A Akshintala… - Proceedings of the Twenty …, 2020 - dl.acm.org
Applications are migrating en masse to the cloud, while accelerators such as GPUs, TPUs,
and FPGAs proliferate in the wake of Moore's Law. These trends are in conflict: cloud …

Characterizing Power Management Opportunities for LLMs in the Cloud

P Patel, E Choukse, C Zhang, Í Goiri, B Warrier… - Proceedings of the 29th …, 2024 - dl.acm.org
Recent innovation in large language models (LLMs), and their myriad use cases have
rapidly driven up the compute demand for datacenter GPUs. Several cloud providers and …

VADI: GPU virtualization for an automotive platform

C Lee, SW Kim, C Yoo - IEEE Transactions on Industrial …, 2015 - ieeexplore.ieee.org
Modern vehicles are evolving with more electronic components than ever before (In this
paper,“vehicle” means “automotive vehicle.” It is also equal to “car.”) One notable example is …

Disaggregated GPU Acceleration for Serverless Applications

H Fingler, Z Zhu, E Yoon, Z Jia, E Witchel… - ACM SIGOPS …, 2023 - dl.acm.org
Serverless platforms have been attracting applications from traditional platforms because
infrastructure management responsibilities are shifted from users to providers. Many …