Parallel Computation Network Device

G Elias, L Levi, E Romlet, A Marelli - US Patent App. 16/357,356, 2020 - Google Patents
In one embodiment, a network device includes ports to serve as ingress ports and as egress
ports, streaming aggregation circuitry to analyze received data packets to identify the data …

Neural network compute tile

O Temam, R Narayanaswami, H Khaitan… - US Patent …, 2022 - Google Patents
A computing unit is disclosed, comprising a first memory bank for storing input activations
and a second memory bank for storing parameters used in performing computations. The …

Splitting neural network filters for implementation by neural network inference circuit

J Ko, K Duong, SL Teig - US Patent 11,250,326, 2022 - Google Patents
Some embodiments provide a method for compiling a neural network (NN) program for an
NN inference circuit (NNIC) that includes multiple partial dot product computation cir cuits …

Data representation for dynamic precision in neural network cores

JV Arthur, AS Cassidy, MD Flickner, P Datta… - US Patent App. 16 …, 2020 - Google Patents
Systems for neural network computation are provided. A neural network processor
comprises a plurality of neural cores. The neural network processor has one or more pro …

Storage of input values within core of neural network inference circuit

K Duong, J Ko, SL Teig - US Patent 11,468,145, 2022 - Google Patents
Some embodiments provide a neural network inference circuit (NNIC) for executing a NN
that includes multiple computation nodes at multiple layers. Each of a set of the computation …

High performance computing system

R Graham, L Levi - US Patent 11,625,393, 2023 - Google Patents
Part of the communications includes collective operation such as (by way of non-limiting
example) doing sum of multiple vectors (element-wise add operation) from multiple …

Weight value decoder of neural network inference circuit

K Duong, J Ko, SL Teig - US Patent 11,210,586, 2021 - Google Patents
a typical way for an integrated circuit to compute these weight values is to use multiply-
accumulate (MAC) circuits that repeatedly perform the multiplication of an input value by a …

Collective communication system and methods

R Graham, L Levi, G Bloch, D Marcovitch… - US Patent …, 2021 - Google Patents
A method in which a plurality of process are configured to hold a block of data destined for
other processes, with data repacking circuitry including receiving circuitry configured to …

Network element supporting flexible data reduction operations

O Ben-Moshe, L Levi, I Rabenstein, I Matari… - US Patent …, 2022 - Google Patents
(57) ABSTRACT A network element includes a plurality of ports, multiple computational
modules, configurable forwarding circuitry and a central block. The ports include child ports …

Neural network accelerator with compact instruct set

T Afzal - US Patent 11,520,561, 2022 - Google Patents
Described herein is a neural network accelerator with a set of neural processing units and
an instruction set for execu tion on the neural processing units. The instruction set is a …