CRUM: Checkpoint-restart support for CUDA's unified memory

R Garg, A Mohan, M Sullivan… - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
Unified Virtual Memory (UVM) was recently introduced with CUDA version 8 and the Pascal
GPU. The older CUDA programming style is akin to older large-memory UNIX applications …

Case: A compiler-assisted scheduling framework for multi-gpu systems

C Chen, C Porter, S Pande - Proceedings of the 27th ACM SIGPLAN …, 2022 - dl.acm.org
Modern computing platforms tend to deploy multiple GPUs on a single node to boost
performance. GPUs have large computing capacities and are an expensive resource …

Accelerator virtualization in fog computing: Moving from the cloud to the edge

B Varghese, C Reano, F Silla - IEEE Cloud Computing, 2018 - ieeexplore.ieee.org
Hardware accelerators are available on the cloud for enhanced analytics. Next-generation
clouds aim to bring enhanced analytics using accelerators closer to user devices at the edge …

Distributed configuration, authorization and management in the cloud-based internet of things

M Henze, B Wolters, R Matzutt… - 2017 IEEE Trustcom …, 2017 - ieeexplore.ieee.org
Network-based deployments within the Internet of Things increasingly rely on the cloud-
controlled federation of individual networks to configure, authorize, and manage devices …

Hierarchical Resource Partitioning on Modern GPUs: A Reinforcement Learning Approach

U Saroliya, E Arima, D Liu… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
GPU-based heterogeneous architectures are now commonly used in HPC clusters. Due to
their architectural simplicity specialized for data-level parallelism, GPUs can offer much …

cCUDA: Effective co-scheduling of concurrent kernels on GPUs

SK Shekofteh, H Noori, M Naghibzadeh… - … on Parallel and …, 2019 - ieeexplore.ieee.org
While GPUs are meantime omnipresent for many scientific and technical computations, they
still continue to evolve as processors. An important recent feature is the ability to execute …

Optimizing hardware resource partitioning and job allocations on modern gpus under power caps

E Arima, M Kang, I Saba, J Weidendorfer… - … Proceedings of the …, 2022 - dl.acm.org
CPU-GPU heterogeneous systems are now commonly used in HPC (High-Performance
Computing). However, improving the utilization and energy-efficiency of such systems is still …

Qos-aware scheduling of remote rendering for interactive multimedia applications in edge computing

R Xie, J Fang, J Yao, K Liu, X Jia… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Leveraging emerging edge computing and 5G networks, researchers proposed to offload
the 3D rendering of interactive multimedia applications (eg, virtual reality and cloud gaming) …

Optimizing on-demand gpus in the cloud for deep learning applications training

A Jahani, M Lattuada, M Ciavotta… - … and Security (ICCCS …, 2019 - ieeexplore.ieee.org
Deep learning (DL) methods have recently gained popularity and been used in
commonplace applications; voice and face recognition, among the others. Despite the …

A new preemptive task scheduling framework for heterogeneous embedded systems

JA Ayala-Barbosa, PE Mendez-Monroy - Proceedings of the 2022 8th …, 2022 - dl.acm.org
In recent years, the graphics processing units (GPUs) have been used to generate real-time
applications in embedded systems; due to the programmability, high performance, and low …