J Vanne, E Aho, TD Hamalainen… - IEEE Transactions on …, 2008 - ieeexplore.ieee.org
This paper proposes an efficient parallel memory system for algorithms applied in fixed and variable block-size motion estimation (VBSME). The proposed system is implemented by a …
WY Lo, DPK Lun, WC Siu, W Wang… - IEEE transactions on …, 2011 - ieeexplore.ieee.org
Single instruction multiple data (SIMD) execution is in no doubt an efficient way to exploit the data level parallelism in image and video applications. However, SIMD execution …
We propose an interleaved memory organization supporting multi-pattern parallel accesses in two-dimensional (2D) addressing space. Our proposal targets computing systems with …
Single-Instruction-Multiple-Data (SIMD) architectures are widely used to accelerate applications involving Data-Level Parallelism (DLP); the on-chip memory system facilitates …
R Jakovljević, A Berić, E Van Dalen… - Journal of Real-Time …, 2018 - Springer
Accessing pixels in memory is a well-known bottleneck of SIMD (single instruction multiple data) processors in video/imaging. To tackle it, we propose new block and row access …
Many applications running on parallel processors and accelerators are bandwidth bound. In this work, we explore the benefits of parallel (scratch-pad) memories to further accelerate …
E Aho, J Vanne, TD HÄmÄlÄinen - Journal of Signal Processing Systems, 2008 - Springer
In modern multimedia applications, memory bottleneck can be alleviated with special stride data accesses. Data elements in stride access can be retrieved in parallel with parallel …
In this paper we present an efficient data fetch circuitry to retrieve several operands from an- way parallel memory system in a single machine cycle. The proposed address generation …