[HTML][HTML] A survey on hardware accelerators: Taxonomy, trends, challenges, and perspectives

B Peccerillo, M Mannino, A Mondelli… - Journal of Systems …, 2022 - Elsevier
In recent years, the limits of the multicore approach emerged in the so-called “dark silicon”
issue and diminishing returns of an ever-increasing core count. Hardware manufacturers …

Demystifying graph databases: Analysis and taxonomy of data organization, system designs, and graph queries

M Besta, R Gerstenberger, E Peter, M Fischer… - ACM Computing …, 2023 - dl.acm.org
Numerous irregular graph datasets, for example social networks or web graphs, may contain
even trillions of edges. Often, their structure changes over time and they have domain …

Hygcn: A gcn accelerator with hybrid architecture

M Yan, L Deng, X Hu, L Liang, Y Feng… - … Symposium on High …, 2020 - ieeexplore.ieee.org
Inspired by the great success of neural networks, graph convolutional neural networks
(GCNs) are proposed to analyze graph data. GCNs mainly include two phases with distinct …

Gemini: A {Computation-Centric} distributed graph processing system

X Zhu, W Chen, W Zheng, X Ma - 12th USENIX Symposium on Operating …, 2016 - usenix.org
Traditionally distributed graph processing systems have largely focused on scalability
through the optimizations of inter-node communication and load balance. However, they …

Graphicionado: A high-performance and energy-efficient accelerator for graph analytics

TJ Ham, L Wu, N Sundaram, N Satish… - 2016 49th annual …, 2016 - ieeexplore.ieee.org
Graphs are one of the key data structures for many real-world computing applications and
the importance of graph analytics is ever-growing. While existing software graph processing …

The GAP benchmark suite

S Beamer, K Asanović, D Patterson - arXiv preprint arXiv:1508.03619, 2015 - arxiv.org
We present a graph processing benchmark suite with the goal of helping to standardize
graph processing evaluations. Fewer differences between graph processing evaluations will …

Outerspace: An outer product based sparse matrix multiplication accelerator

S Pal, J Beaumont, DH Park… - … Symposium on High …, 2018 - ieeexplore.ieee.org
Sparse matrices are widely used in graph and data analytics, machine learning, engineering
and scientific applications. This paper describes and analyzes OuterSPACE, an accelerator …

Sisa: Set-centric instruction set architecture for graph mining on processing-in-memory systems

M Besta, R Kanakagiri, G Kwasniewski… - MICRO-54: 54th Annual …, 2021 - dl.acm.org
Simple graph algorithms such as PageRank have been the target of numerous hardware
accelerators. Yet, there also exist much more complex graph mining algorithms for problems …

DAMOV: A new methodology and benchmark suite for evaluating data movement bottlenecks

GF Oliveira, J Gómez-Luna, L Orosa, S Ghose… - IEEE …, 2021 - ieeexplore.ieee.org
Data movement between the CPU and main memory is a first-order obstacle against improv
ing performance, scalability, and energy efficiency in modern systems. Computer systems …

Graphit: A high-performance graph dsl

Y Zhang, M Yang, R Baghdadi, S Kamil… - Proceedings of the …, 2018 - dl.acm.org
The performance bottlenecks of graph applications depend not only on the algorithm and
the underlying hardware, but also on the size and structure of the input graph. As a result …