In-memory grid files on graphics processors

K Yang, B He, R Fang, M Lu, N Govindaraju… - Proceedings of the 3rd …, 2007 - dl.acm.org
… , we design a massively multi-threaded GPU-based grid file for static, … grid file variant to
handle data skews efficiently. Our implementations on the NVIDIA G80 GTX graphics card are …

GFlink: An in-memory computing architecture on heterogeneous CPU-GPU clusters for big data

C Chen, K Li, A Ouyang, Z Zeng… - IEEE Transactions on …, 2018 - ieeexplore.ieee.org
… 2.1 GPGPU A Graphics Processing Unit was extended to the generalpurpose high-performance
computing area after the emergence of General Purpose GPU (GPGPU) under certain …

PIMS: A lightweight processing-in-memory accelerator for stencil computations

J Li, X Wang, A Tumeo, B Williams, JD Leidel… - Proceedings of the …, 2019 - dl.acm.org
… Our comprehensive evaluation using three different grid sizes with six … GPUs using CUDA.
In Proceedings of 2nd workshop on general purpose processing on graphics processing

A comprehensive study of in-memory computing on large HPC systems

D Huang, Z Qin, Q Liu, N Podhorszki… - 2020 IEEE 40th …, 2020 - ieeexplore.ieee.org
… of in-memory libraries between CPUs and GPUs. We found that GPU is mostly not supported
by the current in-memory … interconnects, eg, NVLink, between GPUs, we believe this will an …

iPIM: Programmable in-memory image processing accelerator using near-bank architecture

P Gu, X Xie, Y Ding, G Chen, W Zhang… - 2020 ACM/IEEE 47th …, 2020 - ieeexplore.ieee.org
… To validate this bottleneck, we conduct a detailed profiling of representative image processing
… To overcome the bottleneck of memory bandwidth, the 3Dstacking processing-in-memory (…

[PDF][PDF] Graphics and computing GPUs

J Nickolls, D Kirk - … /Software Interface, DA Patterson and JL …, 2009 - harmanani.github.io
… as both a programmable graphics processor and a scalable … of result data grids, partitioning
each result grid into coarse-… , and 3D texture arrays in memory via the texture subsystem. …

Grid: OneCode and FourAPIs

A Yamaguchi, P Boyle, G Cossu, G Filaci, C Lehner… - 2022 - inspirehep.net
Grid already had a parallel for construct used to target OpenMP threaded loops on multicore
… ” transformation is needed in data arrays in memory. However they semantically differ in the …

Grid: OneCode and FourAPIs

P Boyle, G Cossu, G Filaci, C Lehner, A Portelli… - arXiv preprint arXiv …, 2022 - arxiv.org
Grid already had a parallel for construct used to target OpenMP threaded loops on multicore
… ” transformation is needed in data arrays in memory. However they semantically differ in the …

[PDF][PDF] 32-bit Graphics Processing Unit

DVHP TEJA - 2016 - eescholars.iitm.ac.in
… A thread block has a block ID within its grid. A grid is an array of thread blocks that execute
… The content of resister which needs to be stored in memory is loaded into DM_Write signal …

Infinity stream: Portable and programmer-friendly in-/near-memory fusion

Z Wang, C Liu, A Arora, L John… - Proceedings of the 28th …, 2023 - dl.acm.org
… of workloads that can benefit from in-memory computation have very simple parallelism
and … in-memory commands. Further, the tDFG is a unified abstraction for near-data and inmemory