CPU-assisted GPU thread pool model for dynamic task parallelism

S Zhang, T Li, Q Dong, X Liu… - 2015 IEEE International …, 2015 - ieeexplore.ieee.org
With the growing power of GPUs, how to utilize the high computing performance provided by
the GPU hardware becomes an urgent yet challenging problem, especially for applications …

Reducing GPU offload latency via fine-grained CPU-GPU synchronization

D Lustig, M Martonosi - 2013 IEEE 19th International …, 2013 - ieeexplore.ieee.org
GPUs are seeing increasingly widespread use for general purpose computation due to their
excellent performance for highly-parallel, throughput-oriented applications. For many …

Understanding the impact of fine-grained data sharing and thread communication on heterogeneous workload development

T Ta, D Troendle, X Hu, B Jang - 2017 16th International …, 2017 - ieeexplore.ieee.org
The conventional OpenCL 1. x style CPU-GPU heterogeneous computing paradigm treats
the CPU and GPU processors as loosely connected separate entities. At best each executes …

Evaluating the effect of last-level cache sharing on integrated GPU-CPU systems with heterogeneous applications

V Garcıa, J Gomez-Luna, T Grass… - 2016 IEEE …, 2016 - ieeexplore.ieee.org
Heterogeneous systems are ubiquitous in the field of High-Performance Computing (HPC).
Graphics processing units (GPUs) are widely used as accelerators for their enormous …

A hybrid shared memory heterogeneous execution platform for PCIe-based GPGPUs

SK Shukla, LN Bhuyan - 20th Annual International Conference …, 2013 - ieeexplore.ieee.org
The disparity between the CPU and GPU domains has forced the programmers to adhere to
the traditional driver-based GPU programming approach. The negative implications of this …

A dynamic special-purpose scheduler for concurrent kernels on gpu

R Mohammadi, SK Shekofieh… - … on Computer and …, 2016 - ieeexplore.ieee.org
GPUs are widely used as powerful accelerators for data-parallel applications such as
financial and scientific applications in industrial and scientific areas. Effective scheduling of …

Dynamic sharing of GPUs in cloud systems

KM Diab, MM Rafique… - 2013 IEEE International …, 2013 - ieeexplore.ieee.org
The use of computational accelerators, specifically programmable GPUs, is becoming
popular in cloud computing environments. Cloud vendors currently provide GPUs as …

A performance prediction model for memory-intensive GPU kernels

Z Hu, G Liu - 2014 IEEE Symposium on Computer Applications …, 2014 - ieeexplore.ieee.org
Commodity graphic processing units (GPUs) have rapidly evolved to become high
performance accelerators for data-parallel computing through a large array of processing …

[PDF][PDF] Towards multi-tenant GPGPU: Event-driven programming model for system-wide scheduling on shared GPUs

Y Suzuki, H Yamada, S Kato, K Kono - Proceedings of the Workshop …, 2016 - cs.utexas.edu
Graphics processing units (GPUs) are attractive to the generalpurpose computing (GPGPU)
beyond the graphics purpose. Sharing GPUs among such GPGPU applications is a key …

Dynamic Task Scheduling Scheme for a GPGPU Programming Framework

K Ohno, R Yamamoto - 2015 Third International Symposium on …, 2015 - ieeexplore.ieee.org
The computational power and the physical memory size of a single GPU device are often
insufficient for large-scale problems. Using CUDA, the user must explicitly partition such …