GRIP: A graph neural network accelerator architecture

K Kiningham, P Levis, C Ré - IEEE Transactions on Computers, 2022 - ieeexplore.ieee.org
We present GRIP, a graph neural network accelerator architecture designed for low-latency
inference. Accelerating GNNs is challenging because they combine two distinct types of …

GraphBLAST: A high-performance linear algebra-based graph framework on the GPU

C Yang, A Buluç, JD Owens - ACM Transactions on Mathematical …, 2022 - dl.acm.org
High-performance implementations of graph algorithms are challenging to implement on
new parallel hardware such as GPUs because of three challenges:(1) the difficulty of coming …

An analysis of the graph processing landscape

ME Coimbra, AP Francisco, L Veiga - journal of Big Data, 2021 - Springer
The value of graph-based big data can be unlocked by exploring the topology and metrics of
the networks they represent, and the computational approaches to this exploration take on …

GaaS-X: Graph analytics accelerator supporting sparse data representation using crossbar architectures

N Challapalle, S Rampalli, L Song… - 2020 ACM/IEEE 47th …, 2020 - ieeexplore.ieee.org
Graph analytics applications are ubiquitous in this era of a connected world. These
applications have very low compute to byte-transferred ratios and exhibit poor locality, which …

Design of the GraphBLAS API for C

A Buluç, T Mattson, S McMillan… - 2017 IEEE …, 2017 - ieeexplore.ieee.org
The purpose of the GraphBLAS Forum is to standardize linear-algebraic building blocks for
graph computations. An important part of this standardization effort is to translate the …

Wholegraph: A fast graph neural network training framework with multi-gpu distributed shared memory architecture

D Yang, J Liu, J Qi, J Lai - SC22: International Conference for …, 2022 - ieeexplore.ieee.org
Graph neural networks (GNNs) are prevalent to deal with graph-structured datasets,
encoding graph data into low dimensional vectors. In this paper, we present a fast training …

Commongraph: Graph analytics on evolving data

M Afarin, C Gao, S Rahman, N Abu-Ghazaleh… - Proceedings of the 28th …, 2023 - dl.acm.org
We consider the problem of graph analytics on evolving graphs (ie, graphs that change over
time). In this scenario, a query typically needs to be applied to different snapshots of the …

Exploring data analytics without decompression on embedded GPU systems

Z Pan, F Zhang, Y Zhou, J Zhai, X Shen… - … on Parallel and …, 2021 - ieeexplore.ieee.org
With the development of computer architecture, even for embedded systems, GPU devices
can be integrated, providing outstanding performance and energy efficiency to meet the …

Abcdplace: Accelerated batch-based concurrent detailed placement on multithreaded cpus and gpus

Y Lin, W Li, J Gu, H Ren, B Khailany… - IEEE transactions on …, 2020 - ieeexplore.ieee.org
Placement is an important step in modern verylarge-scale integrated (VLSI) designs.
Detailed placement is a placement refining procedure intensively called throughout the …

SEP-graph: finding shortest execution paths for graph processing under a hybrid framework on GPU

H Wang, L Geng, R Lee, K Hou, Y Zhang… - Proceedings of the 24th …, 2019 - dl.acm.org
In general, the performance of parallel graph processing is determined by three pairs of
critical parameters, namely synchronous or asynchronous execution mode (Sync or Async) …