Mixed-precision neural-processing unit tile

I Ovsiannikov - US Patent App. 16/847,504, 2020 - Google Patents
A processor. In some embodiments, the processor includes: a first tile, the first tile being
configured: to feed a first nibble from a third queue, through a first shuffler, to a first multi …

Homogenizing data sparsity using a butterfly multiplexer

I Ovsiannikov, L Wang - US Patent 11,664,819, 2023 - Google Patents
US11664819B2 - Homogenizing data sparsity using a butterfly multiplexer - Google Patents
US11664819B2 - Homogenizing data sparsity using a butterfly multiplexer - Google Patents …

Accelerating 2D convolutional layer mapping on a dot product architecture

AS Ardestani, J Hassoun - US Patent 12,112,141, 2024 - Google Patents
A method for performing a convolution operation includes storing, a convolution kernel in a
first storage device, the convolution kernel having dimensions x by y; storing, in a second …

Neural processor

I Ovsiannikov, AS Ardestani, JH Hassoun… - US Patent …, 2024 - Google Patents
A neural processor. In some embodiments, the processor includes a first tile, a second tile, a
memory, and a bus. The bus may be connected to the memory, the first tile, and the second …

Neural processor

I Ovsiannikov, AS Ardestani, JH Hassoun… - US Patent …, 2024 - Google Patents
A neural processor. In some embodiments, the processor includes a first tile, a second tile, a
memory, and a bus. The bus may be connected to the memory, the first tile, and the second …

Methods, apparatus, and articles of manufacture to increase data reuse for multiply and accumulate (mac) operations

N Hanrahan, M Power, K Brady, MT Grymel… - US Patent App. 18 …, 2024 - Google Patents
US20240036763A1 - Methods, apparatus, and articles of manufacture to increase data reuse
for multiply and accumulate (mac) operations - Google Patents US20240036763A1 - Methods …

Hardware channel-parallel data compression/decompression

I Ovsiannikov, AS Ardestani, L Wang… - US Patent …, 2024 - Google Patents
A multichannel data packer includes a plurality of two-input multiplexers and a controller.
The plurality of two-input multiplexers is arranged in 2 N rows and N columns in which N is …

Image recognition method, image recognition apparatus, and non-transitory computer readable recording medium storing an image recognition program

T Miyamoto, K Tanaka, K Morimoto… - US Patent App. 17 …, 2022 - Google Patents
An image recognition method includes a feature amount extracting step of generating, from
an input image, a base feature map group including a plurality of base feature maps; an …

Compute-in-memory deep neural network inference engine using low-rank approximation technique

TT Hoang, WH Choi, M Lueker-boden - US Patent 11,663,471, 2023 - Google Patents
US11663471B2 - Compute-in-memory deep neural network inference engine using low-rank
approximation technique - Google Patents US11663471B2 - Compute-in-memory deep neural …

Channel-parallel compression with random memory access

I Ovsiannikov - US Patent 11,716,095, 2023 - Google Patents
A data compressor a zero-value remover, a zero bit mask generator, a non-zero values
packer, and a row-pointer generator. The zero-value remover receives 2 N bit streams of …