作者
Song Sun, Joseph Zambreno
发表日期
2009/12/9
研讨会论文
Proceedings of the International Conference on Field-Programmable Technology (FPT)
简介
A floating-point accumulator for FPGA-based high performance computing applications is proposed and evaluated. Compared to previous work, our accumulator uses a fixed size circuit, and can reduce an arbitrary number of input sets of varying sizes without requiring prior knowledge of the bounds of summands. In this paper, we describe how the adder accumulator operator can be heavily pipelined to achieve a high clock speed when mapped to FPGA technology, while still maintaining the original input ordering. Our experimental results show that our accumulator design is very competitive with previous efforts in terms of FPGA resource usage and clock frequency, making it an ideal building block for large-scale sparse matrix computations as implemented in FPGA-based high performance computing systems.
引用总数
201020112012201320142015201620172018201920202021202220232338425423323
学术搜索中的文章