作者
Arnab A Purkayastha, Sai Raghavendran, Jhanani Thiagarajan, Hamed Tabkhi
发表日期
2019/9/24
研讨会论文
2019 IEEE High Performance Extreme Computing Conference (HPEC)
页码范围
1-7
出版商
IEEE
简介
OpenCL programming ability combined with OpenCL High-Level Synthesis (OpenCL-HLS) tools have made tremendous improvements in the reconfigurable computing field. FPGAs inherent pipelined parallelism capability provides not only faster execution times but also power-efficient solutions when executing massively parallel applications. A major execution bottleneck affecting FPGA performance is the high number of memory stalls exposed to pipelined data-path that hinders the benefits of data-path customization.This paper explores the efficiency of “OpenCL Pipe” to hide memory access latency on cloud FPGAs by decoupling memory access from computation. The Pipe semantic is leveraged to split OpenCL kernels into “read”, “compute” and “write back” sub-kernels which work concurrently to overlap the computation of current threads with the memory access of future threads. For evaluation, we use a mix …
引用总数
2021202220232024221
学术搜索中的文章
AA Purkayastha, S Raghavendran, J Thiagarajan… - 2019 IEEE High Performance Extreme Computing …, 2019