GNNLab: a factored system for sample-based GNN training over GPUs

J Yang, D Tang, X Song, L Wang, Q Yin… - Proceedings of the …, 2022 - dl.acm.org
We propose GNNLab, a sample-based GNN training system in a single machine multi-GPU
setup. GNNLab adopts a factored design for multiple GPUs, where each GPU is dedicated to …

Large graph convolutional network training with GPU-oriented data communication architecture

SW Min, K Wu, S Huang, M Hidayetoğlu… - arXiv preprint arXiv …, 2021 - arxiv.org
Graph Convolutional Networks (GCNs) are increasingly adopted in large-scale graph-based
recommender systems. Training GCN requires the minibatch generator traversing graphs …

Smartsage: training large-scale graph neural networks using in-storage processing architectures

Y Lee, J Chung, M Rhu - Proceedings of the 49th Annual International …, 2022 - dl.acm.org
Graph neural networks (GNNs) can extract features by learning both the representation of
each objects (ie, graph nodes) and the relationship across different objects (ie, the edges …

Algorithm and system co-design for efficient subgraph-based graph representation learning

H Yin, M Zhang, Y Wang, J Wang, P Li - arXiv preprint arXiv:2202.13538, 2022 - arxiv.org
Subgraph-based graph representation learning (SGRL) has been recently proposed to deal
with some fundamental challenges encountered by canonical graph neural networks …

Graphsnapshot: Graph machine learning acceleration with fast storage and retrieval

D Liu, R Waleffe, M Jiang, S Venkataraman - arXiv preprint arXiv …, 2024 - arxiv.org
In our recent research, we have developed a framework called GraphSnapShot, which has
been proven an useful tool for graph learning acceleration. GraphSnapShot is a framework …

Machine learning and data cleaning: Which serves the other?

IF Ilyas, T Rekatsinas - ACM Journal of Data and Information Quality …, 2022 - dl.acm.org
The last few years witnessed significant advances in building automated or semi-automated
data quality, data cleaning and data integration systems powered by machine learning (ML) …

Communication-efficient graph neural networks with probabilistic neighborhood expansion analysis and caching

T Kaler, A Iliopoulos, P Murzynowski… - Proceedings of …, 2023 - proceedings.mlsys.org
Training and inference with graph neural networks (GNNs) on massive graphs in a
distributed environment has been actively studied since the inception of GNNs, owing to the …

GNN at the edge: Cost-efficient graph neural network processing over distributed edge servers

L Zeng, C Yang, P Huang, Z Zhou… - IEEE Journal on …, 2022 - ieeexplore.ieee.org
Edge intelligence has arisen as a promising computing paradigm for supporting
miscellaneous smart applications that rely on machine learning techniques. While the …

Mariusgnn: Resource-efficient out-of-core training of graph neural networks

R Waleffe, J Mohoney, T Rekatsinas… - Proceedings of the …, 2023 - dl.acm.org
We study training of Graph Neural Networks (GNNs) for large-scale graphs. We revisit the
premise of using distributed training for billion-scale graphs and show that for graphs that fit …

Saga: A platform for continuous construction and serving of knowledge at scale

IF Ilyas, T Rekatsinas, V Konda, J Pound, X Qi… - Proceedings of the …, 2022 - dl.acm.org
We introduce Saga, a next-generation knowledge construction and serving platform for
powering knowledge-based applications at industrial scale. Saga follows a hybrid batch …