KiloCore: A 32-nm 1000-processor computational array

B Bohnenstiehl, A Stillmaker… - IEEE Journal of Solid …, 2017 - ieeexplore.ieee.org
A processor array containing 1000 independent processors and 12 memory modules was
fabricated in 32-nm partially depleted silicon on insulator CMOS. The programmable …

Hybrid hardware/software floating-point implementations for optimized area and throughput tradeoffs

JJ Pimentel, B Bohnenstiehl… - IEEE Transactions on …, 2016 - ieeexplore.ieee.org
Hybrid floating-point (FP) implementations improve software FP performance without
incurring the area overhead of full hardware FP units. The proposed implementations are …

Efficient emulation of floating-point arithmetic on fixed-point SIMD processors

L Gerlach, G Payá-Vayá… - 2016 IEEE International …, 2016 - ieeexplore.ieee.org
In this paper, a software floating-point emulation library for fixed-point SIMD processors is
proposed. The single instruction multiple data (SIMD) mechanism of those processors is …

Area efficient backprojection computation with reduced floating-point word width for SAR image formation

JJ Pimentel, A Stillmaker… - 2015 49th Asilomar …, 2015 - ieeexplore.ieee.org
The widths of data words in digital processors have a direct impact on area in application-
specific ICs (ASICs) and FPGAs. Circuit area impacts energy dissipation per workload and …

[图书][B] Design and programming of the KiloCore processor arrays

B Bohnenstiehl - 2020 - search.proquest.com
Modern semiconductor fabrication technologies now enable the construction of integrated
circuits which contain over 1000 processors on a single chip. However, for such systems to …

[图书][B] Sparse matrix multiplication on a many-core platform

P Shi - 2018 - vcl.ece.ucdavis.edu
Sparse matrix-vector multiplication (SpMV) is a critical operation in scientific computing and
engineering applications. This thesis explores implementing SpMV kernels on a many-core …

[图书][B] AlexNet deep neural network on a many core platform

FE Borges - 2019 - search.proquest.com
Deep neural networks are used in many engineering applications such as autonomous
driving, image recognition, natural language processing, etc. For real time applications, low …

[图书][B] Design of energy-efficient many-core MIMD GALS processor arrays in the 1000-processor era

AT Stillmaker - 2015 - search.proquest.com
As transistor sizes continue to scale, more transistors are able to be used in a fixed die size.
The recent trend for general purpose processing units is to use the increased number of …

[图书][B] Methods for reducing floating-point computation overhead

J Pimentel - 2017 - search.proquest.com
Despite floating-point (FP) being the most commonly used method for real number
representation, certain architectures are still limited to fixed-point arithmetic due to the large …

KAVUAKA: a low-power application-specific processor architecture for digital hearing aids

L Gerlach - 2021 - repo.uni-hannover.de
The power consumption of digital hearing aids is very restricted due to their small physical
size and the available hardware resources for signal processing are limited. However, there …