Sisa: Set-centric instruction set architecture for graph mining on processing-in-memory systems

M Besta, R Kanakagiri, G Kwasniewski… - MICRO-54: 54th Annual …, 2021 - dl.acm.org
Simple graph algorithms such as PageRank have been the target of numerous hardware
accelerators. Yet, there also exist much more complex graph mining algorithms for problems …

Prodigy: Improving the memory latency of data-indirect irregular workloads using hardware-software co-design

N Talati, K May, A Behroozi, Y Yang… - … Symposium on High …, 2021 - ieeexplore.ieee.org
Irregular workloads are typically bottlenecked by the memory system. These workloads often
use sparse data representations, eg, compressed sparse row/column (CSR/CSC), to …

PHI: Architectural support for synchronization-and bandwidth-efficient commutative scatter updates

A Mukkara, N Beckmann, D Sanchez - … of the 52nd Annual IEEE/ACM …, 2019 - dl.acm.org
Many applications perform frequent scatter update operations to large data structures. For
example, in push-style graph algorithms, processing each vertex requires updating the data …

Betty: Enabling large-scale gnn training with batch-level graph partitioning

S Yang, M Zhang, W Dong, D Li - Proceedings of the 28th ACM …, 2023 - dl.acm.org
The Graph Neural Network (GNN) is showing outstanding results in improving the
performance of graph-based applications. Recent studies demonstrate that GNN …

Ndminer: accelerating graph pattern mining using near data processing

N Talati, H Ye, Y Yang, L Belayneh, KY Chen… - Proceedings of the 49th …, 2022 - dl.acm.org
Graph Pattern Mining (GPM) algorithms mine structural patterns in graphs. The performance
of GPM workloads is bottlenecked by control flow and memory stalls. This is because of data …

P-opt: Practical optimal cache replacement for graph analytics

V Balaji, N Crago, A Jaleel… - 2021 IEEE International …, 2021 - ieeexplore.ieee.org
Graph analytics is an important workload that achieves suboptimal performance due to poor
cache locality. State-of-the-art cache replacement policies fail to capture the highly dynamic …

SpZip: Architectural support for effective data compression in irregular applications

Y Yang, JS Emer, D Sanchez - 2021 ACM/IEEE 48th Annual …, 2021 - ieeexplore.ieee.org
Irregular applications, such as graph analytics and sparse linear algebra, exhibit frequent
indirect, data-dependent accesses to single or short sequences of elements that cause high …

Speeding up SpMV for power-law graph analytics by enhancing locality & vectorization

S Yesil, A Heidarshenas, A Morrison… - … Conference for High …, 2020 - ieeexplore.ieee.org
Graph analytics applications often target large-scale web and social networks, which are
typically power-law graphs. Graph algorithms can often be recast as generalized Sparse …

{Large-Scale} Graph Processing on Emerging Storage Devices

N Elyasi, C Choi, A Sivasubramaniam - 17th USENIX Conference on File …, 2019 - usenix.org
Graph processing is becoming commonplace in many applications to analyze huge
datasets. Much of the prior work in this area has assumed I/O devices with considerable …

DPU: DAG processing unit for irregular graphs with precision-scalable posit arithmetic in 28 nm

N Shah, LIG Olascoaga, S Zhao… - IEEE Journal of Solid …, 2021 - ieeexplore.ieee.org
Computation in several real-world applications such as probabilistic machine learning,
sparse linear algebra, and robotic navigation can be modeled as irregular directed acyclic …