Oversubscribing gpu unified virtual memory: Implications and suggestions

C Shao, J Guo, P Wang, J Wang, C Li… - … of the 2022 ACM/SPEC on …, 2022 - dl.acm.org
Recent GPU architectures support unified virtual memory (UVM), which offers great
opportunities to solve larger problems by memory oversubscription. Although some studies …

Skywalker: Efficient alias-method-based graph sampling and random walk on gpus

P Wang, C Li, J Wang, T Wang, L Zhang… - 2021 30th …, 2021 - ieeexplore.ieee.org
Graph sampling and random walk operations, capturing the structural properties of graphs,
are playing an important role today as we cannot directly adopt computing-intensive …

CGgraph: An Ultra-fast Graph Processing System on Modern Commodity CPU-GPU Co-processor

P Cui, H Liu, B Tang, Y Yuan - Proceedings of the VLDB Endowment, 2024 - dl.acm.org
In recent years, many CPU-GPU heterogeneous graph processing systems have been
developed in both academic and industrial to facilitate large-scale graph processing in …

Boosting Data Center Performance via Intelligently Managed Multi-backend Disaggregated Memory

J Wang, H Yang, C Li, Y Zhuansun… - … Conference for High …, 2024 - ieeexplore.ieee.org
Existing disaggregated memory (DM) systems face a problem of underutilized far memory
bandwidth, which greatly limits the data throughput when processing data-intensive …

HyTGraph: GPU-Accelerated Graph Processing with Hybrid Transfer Management

Q Wang, X Ai, Y Zhang, J Chen… - 2023 IEEE 39th …, 2023 - ieeexplore.ieee.org
Processing large graphs with memory-limited GPU needs to resolve issues of host-GPU
data transfer, which is a key performance bottleneck. Existing GPU-accelerated graph …

GRIT: Enhancing Multi-GPU Performance with Fine-Grained Dynamic Page Placement

Y Wang, B Li, A Jaleel, J Yang… - 2024 IEEE International …, 2024 - ieeexplore.ieee.org
Multi-GPU systems have become popular to cater to the growing demands for high
parallelism and large memory capacity. However, the delivered performance is constrained …

Excavating the potential of graph workload on rdma-based far memory architecture

J Wang, C Li, T Wang, L Zhang, P Wang… - 2022 IEEE …, 2022 - ieeexplore.ieee.org
Disaggregated architecture brings new opportunities to memory-consuming applications like
graph processing. It allows one to outspread memory access pressure from local to far …

Adaptive update handling for graph HTAP

MA Jibril, A Baumstark, KU Sattler - Distributed and Parallel Databases, 2023 - Springer
Hybrid transactional/analytical processing (HTAP) workloads on graph data can significantly
benefit from GPU accelerators. However, to exploit the full potential of GPU processing …

HEGrid: A high efficient multi-channel radio astronomical data gridding framework in heterogeneous computing environments

H Wang, C Yu, J Xiao, S Tang, M Long… - Future Generation …, 2023 - Elsevier
The challenge to fully exploit the potential of existing and upcoming scientific instruments
like large single-dish radio telescopes is to process the collected massive data effectively …

Optimizing GPU-Based Graph Sampling and Random Walk for Efficiency and Scalability

P Wang, C Xu, C Li, J Wang, T Wang… - IEEE Transactions …, 2023 - ieeexplore.ieee.org
Graph sampling and random walk algorithms are playing increasingly important roles today
because they can significantly reduce graph size while preserving structural information …