作者
Zhenhua Guo, Baoyu Fan, Yaqian Zhao, Xuelei Li, Shixin Wei, Long Li
发表日期
2018/4/8
图书
International Symposium on Applied Reconfigurable Computing
页码范围
578-589
出版商
Springer International Publishing
简介
With the development of cloud computing, the super-large scale of image data has bring severe challenges for the storage cost and network bandwidth in data centers. In order to alleviate the present situation effectively, WebP has replaced the current mainstream image file format due to its better compression efficiency. In this paper, we provide an OpenCL implementation of WebP accelerator on FPGAs to optimize the performance of WebP Lossy Compression Algorithm. Our accelerator makes use of a heavily-pipelined custom hardware implementation to achieve a high throughput ~450MPixel/s. The performance-per-watt of our OpenCL implementation on Intel’s Arria 10 device is 8.32x better than a highly-tuned CPU implementation on Intel Xeon E5-2690v3 with 24 thread cores. Additionally, the delay time per image can be reduced to ~90% by the data parallelism and macroblock pipelining on FPGAs …
引用总数
学术搜索中的文章
Z Guo, B Fan, Y Zhao, X Li, S Wei, L Li - International Symposium on Applied Reconfigurable …, 2018