Development of a generalized model for parallel-streaming neural element and structures for scalar product calculation devices

I Tsmots, V Teslyuk, N Kryvinska, O Skorokhoda… - The Journal of …, 2023 - Springer
The Journal of Supercomputing, 2023Springer
Nowadays, intensive streams of fuzzy input data need to be processed in real-time for
different fields of science and engineering. To solve this problem, a generalized model for
the parallel-streaming neural element was developed in this paper. The proposed model
allows minimizing hardware costs while providing scalar product and activation function
calculations in real time. In particular, an algorithm and a structure for a parallel-streaming
device (PSD) were developed to calculate a scalar product with the direct formation of partial …
Abstract
Nowadays, intensive streams of fuzzy input data need to be processed in real-time for different fields of science and engineering. To solve this problem, a generalized model for the parallel-streaming neural element was developed in this paper. The proposed model allows minimizing hardware costs while providing scalar product and activation function calculations in real time. In particular, an algorithm and a structure for a parallel-streaming device (PSD) were developed to calculate a scalar product with the direct formation of partial products based on the analysis of a single bit-cut of multipliers, which provides working with the shortest conveyor stage. It is based on a modified Booth’s algorithm that allows reducing equipment costs for processing operands with high bit-width. Moreover, it promotes the lowest equipment costs for the operands with a low bit-width. Besides, researches demonstrate that the main way of increasing the speed of the developed algorithms and structures of PSD for scalar product calculating is a preliminary formation of partial products. Further, the estimation of the model parameters shows reducing conveyor steps, improvement of the locality of connections, and an increase of an adaptation to the coming data intensity. It is proposed to use the developed algorithms and structures as a basis for building devices for parallel-streaming calculation of the scalar product in real time with high efficiency of equipment use. The main ways of harmonizing the time of incoming data and weights with the conveyor cycle of the PSD for calculation of the scalar product are determined. A methodology proposed for building conveyor devices for parallel-streaming calculation of the scalar product in real time for a given intensity of input data ensures the implementation of devices with the required speed and with minimal hardware costs.
Springer
以上显示的是最相近的搜索结果。 查看全部搜索结果