Hardware n choose k counters with applications to the partial exhaustive search

K Nakano, Y Yamagishi - IEICE transactions on information and …, 2005 - search.ieice.org
The main contribution of this work is to present several hardware implementations of an “n
choose k” counter (C (n, k) counter for short), which lists all n-bit numbers with (n− k) 0's and …

Faster optimal parallel prefix circuits: New algorithmic construction

YC Lin, CY Su - Journal of Parallel and Distributed Computing, 2005 - Elsevier
Parallel prefix circuits are parallel prefix algorithms on the combinational circuit model. A
prefix circuit with n inputs is depth-size optimal if its depth plus size equals 2n-2. Smaller …

A new approach to constructing optimal parallel prefix circuits with small depth

YC Lin, JW Hsiao - Journal of Parallel and Distributed Computing, 2004 - Elsevier
Parallel prefix circuits are parallel algorithms performing the prefix operation for the
combinational circuit model. The size of a prefix circuit is the number of operation nodes in …

Accelerating montgomery modulo multiplication for redundant radix-64k number system on the FPGA using dual-port block RAMs

K Shigemoto, K Kawakami… - 2008 IEEE/IFIP …, 2008 - ieeexplore.ieee.org
The main contribution of this paper is to present hardware algorithms for redundant radix-2 r
number system in the FPGA to accelerate Montgomery modulo multiplication with many bits …

FM screening by the local exhaustive search, with hardware acceleration

Y Ito, K Nakano - International Journal of Foundations of Computer …, 2005 - World Scientific
The main contribution of this paper is to show a new approach for FM screening which we
call Local Exhaustive Search (LES) method, and to present ways to accelerate the …

A tiny processing system for education and small embedded systems on the FPGAs

K Nakano, K Kawakami, K Shigemoto… - 2008 IEEE/IFIP …, 2008 - ieeexplore.ieee.org
The main contribution of this paper is to present a simple, scalable, and portable tiny
processing system which can be implemented in various FPGAs. Our processing system …

A new FM screening method to generate cluster-dot binary images using the local exhaustive search with FPGA acceleration

Y Ito, K Nakano - International Journal of Foundations of Computer …, 2008 - World Scientific
Screening is an important task to convert a continuous-tone image into a binary image with
pure black and white pixels. The main contribution of this paper is to show a new algorithm …

Z4: A new depth-size optimal parallel prefix circuit with small depth

YC Lin, JN Chen - Neural, Parallel & Scientific Computations, 2003 - dl.acm.org
Parallel prefix algorithms for the combinational circuit model are called parallel prefix
circuits. An n-input prefix circuit with depth d and size s is depth-size optimal if d+ s= 2 n-2 …

Fast problem-size-independent parallel prefix circuits

YC Lin, LL Hung - Journal of Parallel and Distributed Computing, 2009 - Elsevier
A family of parallel algorithms solving the prefix problem on the combinational circuit model
is presented. These prefix circuits are waist-size optimal with waist 1 (WSO-1). They are not …

Aggregating over Dominated Points by Sorting, Scanning, Zip and Flat Maps

J Sroka, J Tyszkiewicz - arXiv preprint arXiv:2305.16751, 2023 - arxiv.org
Prefix aggregation operation (also called scan), and its particular case, prefix summation, is
an important parallel primitive and enjoys a lot of attention in the research literature. It is also …