A survey of CPU-GPU heterogeneous computing techniques

S Mittal, JS Vetter - ACM Computing Surveys (CSUR), 2015 - dl.acm.org
As both CPUs and GPUs become employed in a wide range of applications, it has been
acknowledged that both of these Processing Units (PUs) have their unique features and …

A survey on convolutional neural network accelerators: GPU, FPGA and ASIC

Y Hu, Y Liu, Z Liu - 2022 14th International Conference on …, 2022 - ieeexplore.ieee.org
In recent years, artificial intelligence (AI) has been under rapid development, applied in
various areas. Among a vast number of neural network (NN) models, the convolutional …

[PDF][PDF] The Chinese Wall Security Policy.

DFC Brewer, MJ Nash - S&P, 1989 - facweb.iitkgp.ac.in
Everyone who has seen the movie Wall Street will have seen a commercial security policy in
action. The recent work of Clark and Wilson and the WIPCIS initiative (the Workshop on …

Efficient sparse matrix-vector multiplication on GPUs using the CSR storage format

JL Greathouse, M Daga - SC'14: Proceedings of the …, 2014 - ieeexplore.ieee.org
The performance of sparse matrix vector multiplication (SpMV) is important to computational
scientists. Compressed sparse row (CSR) is the most frequently used format to store sparse …

Exploring simd for molecular dynamics, using intel® xeon® processors and intel® xeon phi coprocessors

SJ Pennycook, CJ Hughes… - 2013 IEEE 27th …, 2013 - ieeexplore.ieee.org
We analyse gather-scatter performance bottlenecks in molecular dynamics codes and the
challenges that they pose for obtaining benefits from SIMD execution. This analysis informs …

High performance and scalable radix sorting: A case study of implementing dynamic parallelism for GPU computing

D Merrill, A Grimshaw - Parallel Processing Letters, 2011 - World Scientific
The need to rank and order data is pervasive, and many algorithms are fundamentally
dependent upon sorting and partitioning operations. Prior to this work, GPU stream …

[HTML][HTML] In-field classification of the asymptomatic biotrophic phase of potato late blight based on deep learning and proximal hyperspectral imaging

C Qi, M Sandroni, JC Westergaard… - … and Electronics in …, 2023 - Elsevier
Effective detection of potato late blight (PLB) is an essential aspect of potato cultivation.
However, it is a challenge to detect late blight in asymptomatic biotrophic phase in fields with …

An investigation of unified memory access performance in cuda

R Landaverde, T Zhang, AK Coskun… - 2014 IEEE High …, 2014 - ieeexplore.ieee.org
Managing memory between the CPU and GPU is a major challenge in GPU computing. A
programming model, Unified Memory Access (UMA), has been recently introduced by Nvidia …

On the efficacy of a fused CPU+ GPU processor (or APU) for parallel computing

M Daga, AM Aji, W Feng - 2011 Symposium on Application …, 2011 - ieeexplore.ieee.org
The graphics processing unit (GPU) has made significant strides as an accelerator in
parallel computing. However, because the GPU has resided out on PCIe as a discrete …

A GPU implementation of inclusion-based points-to analysis

M Mendez-Lojo, M Burtscher, K Pingali - ACM SIGPLAN Notices, 2012 - dl.acm.org
Graphics Processing Units (GPUs) have emerged as powerful accelerators for many regular
algorithms that operate on dense arrays and matrices. In contrast, we know relatively little …