Parallel 2D FFT implementation on FPGA suitable for real-time MR image processing

L Li, AM Wyrwicz - Review of Scientific Instruments, 2018 - pubs.aip.org
We report the design and implementation of a parallel two-dimensional fast Fourier
transform (2D FFT) algorithm on a Field Programmable Gate Array (FPGA) for real-time MR …

A parallel memory system for variable block-size motion estimation algorithms

J Vanne, E Aho, TD Hamalainen… - IEEE Transactions on …, 2008 - ieeexplore.ieee.org
This paper proposes an efficient parallel memory system for algorithms applied in fixed and
variable block-size motion estimation (VBSME). The proposed system is implemented by a …

Improved SIMD architecture for high performance video processors

WY Lo, DPK Lun, WC Siu, W Wang… - IEEE transactions on …, 2011 - ieeexplore.ieee.org
Single instruction multiple data (SIMD) execution is in no doubt an efficient way to exploit the
data level parallelism in image and video applications. However, SIMD execution …

Memory organization with multi-pattern parallel accesses

A Vitkovski, G Kuzmanov, G Gaydadjiev - Proceedings of the conference …, 2008 - dl.acm.org
We propose an interleaved memory organization supporting multi-pattern parallel accesses
in two-dimensional (2D) addressing space. Our proposal targets computing systems with …

An access-pattern-aware on-chip vector memory system with automatic loading for SIMD architectures

T Geng, E Diken, T Wang, L Jozwiak… - 2018 IEEE High …, 2018 - ieeexplore.ieee.org
Single-Instruction-Multiple-Data (SIMD) architectures are widely used to accelerate
applications involving Data-Level Parallelism (DLP); the on-chip memory system facilitates …

New access modes of parallel memory subsystem for sub-pixel motion estimation

R Jakovljević, A Berić, E Van Dalen… - Journal of Real-Time …, 2018 - Springer
Accessing pixels in memory is a well-known bottleneck of SIMD (single instruction multiple
data) processors in video/imaging. To tackle it, we propose new block and row access …

Towards application-centric parallel memories

G Stramondo, CB Ciobanu, AL Varbanescu… - Euro-Par 2018: Parallel …, 2019 - Springer
Many applications running on parallel processors and accelerators are bandwidth bound. In
this work, we explore the benefits of parallel (scratch-pad) memories to further accelerate …

Configurable data memory for multimedia processing

E Aho, J Vanne, TD HÄmÄlÄinen - Journal of Signal Processing Systems, 2008 - Springer
In modern multimedia applications, memory bottleneck can be alleviated with special stride
data accesses. Data elements in stride access can be retrieved in parallel with parallel …

High-bandwidth address generation unit

C Galuzzi, C Gou, H Calderón, GN Gaydadjiev… - Journal of Signal …, 2009 - Springer
In this paper we present an efficient data fetch circuitry to retrieve several operands from an-
way parallel memory system in a single machine cycle. The proposed address generation …

[PDF][PDF] 面向多兴趣区域图像处理应用的高效无冲突并行访问存储模型

徐金波, 窦勇 - 计算机学报, 2008 - cjc.ict.ac.cn
摘要针对不规则数据访问模式图像处理应用提出了一种通用的高效无冲突并行访问存储模型.
在主存储器与处理器之间构建了一种多体存储结构, 并将大部分的不规则数据访问模式归类为对 …