Distributed graph neural network training: A survey

Y Shao, H Li, X Gu, H Yin, Y Li, X Miao… - ACM Computing …, 2024 - dl.acm.org
Graph neural networks (GNNs) are a type of deep learning models that are trained on
graphs and have been successfully applied in various domains. Despite the effectiveness of …

A Survey on Concurrent Processing of Graph Analytical Queries: Systems and Algorithms

Y Li, S Sun, H Xiao, C Ye, S Lu… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Graph analytical queries (GAQs) are becoming increasingly important in various domains,
including social networks, recommendation systems, and bioinformatics, among others …

Boosting Data Center Performance via Intelligently Managed Multi-backend Disaggregated Memory

J Wang, H Yang, C Li, Y Zhuansun… - … Conference for High …, 2024 - ieeexplore.ieee.org
Existing disaggregated memory (DM) systems face a problem of underutilized far memory
bandwidth, which greatly limits the data throughput when processing data-intensive …

Noswalker: A decoupled architecture for out-of-core random walk processing

S Wang, M Zhang, K Yang, K Chen, S Ma… - Proceedings of the 28th …, 2023 - dl.acm.org
Out-of-core random walk system has recently attracted a lot of attention as an economical
way to run billions of walkers over large graphs. However, existing out-of-core random walk …

Scalable graph sampling on gpus with compressed graph

H Yin, Y Shao, X Miao, Y Li, B Cui - Proceedings of the 31st ACM …, 2022 - dl.acm.org
GPU is a powerful accelerator for parallel computation. Graph sampling is a fundamental
technology for large-scale graph analysis and learning. To accelerate graph sampling using …

gsword: Gpu-accelerated sampling for subgraph counting

C Ye, Y Li, S Sun, W Guo - Proceedings of the ACM on Management of …, 2024 - dl.acm.org
Subgraph counting is a fundamental component for many downstream applications such as
graph representation learning and query optimization. Since obtaining the exact count is …

Excavating the potential of graph workload on rdma-based far memory architecture

J Wang, C Li, T Wang, L Zhang, P Wang… - 2022 IEEE …, 2022 - ieeexplore.ieee.org
Disaggregated architecture brings new opportunities to memory-consuming applications like
graph processing. It allows one to outspread memory access pressure from local to far …

gsampler: General and efficient gpu-based graph sampling for graph learning

P Gong, R Liu, Z Mao, Z Cai, X Yan, C Li… - Proceedings of the 29th …, 2023 - dl.acm.org
Graph sampling prepares training samples for graph learning and can dominate the training
time. Due to the increasing algorithm diversity and complexity, existing sampling frameworks …

Fargraph+: Excavating the parallelism of graph processing workload on RDMA-based far memory system

J Wang, C Li, Y Liu, T Wang, J Mei, L Zhang… - Journal of Parallel and …, 2023 - Elsevier
Disaggregated architecture brings new opportunities to memory-consuming applications like
graph processing. It allows one to outspread memory access pressure from local to far …

Enhancing High-Throughput GPU Random Walks Through Multi-Task Concurrency Orchestration

C Xu, C Li, X Hou, J Mei, J Wang, P Wang… - ACM Transactions on …, 2025 - dl.acm.org
Random walk is a powerful tool for large-scale graph learning, but its high computational
demand presents a challenge. While GPUs can accelerate random walk tasks, current …