查看文章

github.io 中的 [PDF]

PSACS: Highly-Parallel Shuffle Accelerator on Computational Storage.

作者

Chen Zou, Hui Zhang, Andrew A Chien, Yang-Seok Ki

发表日期

2021/10/1

研讨会论文

ICCD

页码范围

480-487

简介

Shuffle is an indispensable process in distributed online analytical processing systems to enable task-level parallelism exploitation via multiple nodes. As a data-intensive data reorganization process, shuffle implemented on general-purpose CPUs not only incurs data traffic back and forth between the computing and storage resources, but also pollutes the cache hierarchy with almost zero data reuse. As a result, shuffle can easily become the bottleneck of distributed analysis pipelines.

Our PSACS approach attacks these bottlenecks with the rising computational storage paradigm. Shuffle is offloaded to the storage-side PSACS accelerator to avoid polluting computing node memory hierarchy and enjoy the latency, bandwidth and energy benefits of near-data computing. Further, the microarchitecture of PSACS exploits data-, subtask-, and task-level parallelism for high performance and a customized scratchpad for fast on-chip random access. PSACS achieves 4.6 x− 5.7 x shuffle throughput at kernel-level and up to 1.3 x overall shuffle throughput with only a twentieth of CPU utilization comparing to software baselines. These mount up to 23% end-to-end OLAP query speedup on average.

引用总数

被引用次数：5

2022202320242 1 2

学术搜索中的文章

PSACS: Highly-Parallel Shuffle Accelerator on Computational Storage.

C Zou, H Zhang, AA Chien, YS Ki - ICCD, 2021

被引用次数：5 相关文章所有 4 个版本