HeteroCore GPU to exploit TLP-resource diversity

X Zhao, Z Wang, L Eeckhout - IEEE Transactions on Parallel …, 2018 - ieeexplore.ieee.org
Graphics processing units (GPUs) are widely adopted as compute accelerators in cloud
computing environments and supercomputers. Sharing GPU resources in such …

OpenCL task partitioning in the presence of GPU contention

D Grewe, Z Wang, MFP O'Boyle - … Workshop, LCPC 2013, San Jose, CA …, 2014 - Springer
Heterogeneous multi-and many-core systems are increasingly prevalent in the desktop and
mobile domains. On these systems it is common for programs to compete with co-running …

Understanding data partition for applications on CPU-GPU integrated processors

J Fang, H Chen, J Mao - Mobile Ad-hoc and Sensor Networks: 13th …, 2018 - Springer
Integrating GPU with CPU on the same chip is increasingly common in current processor
architectures for high performance. CPU and GPU share on-chip network, last level cache …

Kernel scheduling approach for reducing GPU energy consumption

J Li, B Guo, Y Shen, D Li, Y Huang - Journal of computational science, 2018 - Elsevier
To handle the increasing data scale in broad fields, GPU (Graphic Processing Unit) is
integrated with more and more cores to provide powerful computing capability. To obtain …

CPU-assisted GPU thread pool model for dynamic task parallelism

S Zhang, T Li, Q Dong, X Liu… - 2015 IEEE International …, 2015 - ieeexplore.ieee.org
With the growing power of GPUs, how to utilize the high computing performance provided by
the GPU hardware becomes an urgent yet challenging problem, especially for applications …

Automated kernel fusion for GPU based on code motion

J Fukuhara, M Takimoto - Proceedings of the 23rd ACM SIGPLAN …, 2022 - dl.acm.org
Applications implemented for GPU are important in various fields. GPU has many parallel
computing cores and high arithmetic throughput, enabling GPU applications to work …

{Fine-Grained} Resource Sharing for Concurrent {GPGPU} Kernels

C Gregg, J Dorn, K Hazelwood, K Skadron - 4th USENIX Workshop on …, 2012 - usenix.org
General purpose GPU (GPGPU) programming frameworks such as OpenCL and CUDA
allow running individual computation kernels sequentially on a device. However, in some …

MaxPair: enhance OpenCL concurrent kernel execution by weighted maximum matching

Y Wen, MFP O'Boyle, C Fensch - Proceedings of the 11th workshop on …, 2018 - dl.acm.org
Executing multiple OpenCL kernels on the same GPU concurrently is a promising method
for improving hardware utilisation and system performance. Schemes of scheduling impact …

A model-based software solution for simultaneous multiple kernels on GPUs

H Wu, W Liu, H Lin, CL Wang - ACM Transactions on Architecture and …, 2020 - dl.acm.org
As a critical computing resource in multiuser systems such as supercomputers, data centers,
and cloud services, a GPU contains multiple compute units (CUs). GPU Multitasking is an …

Troodon: A machine-learning based load-balancing application scheduler for CPU–GPU system

YN Khalid, M Aleem, U Ahmed, MA Islam… - Journal of Parallel and …, 2019 - Elsevier
Heterogeneous computing machines consisting of a CPU and one or more GPUs are
increasingly being used today because of their higher performance-cost ratio and lower …