Scaled compute fabric for accelerated deep learning

GR Lauterbach, S Lie, M Morrison, ME James… - US Patent …, 2022 - Google Patents
Techniques in advanced deep learning provide improvements in one or more of accuracy,
performance, energy efficiency, and cost. In a first embodiment, a scaled array of processing …

Processor element redundancy for accelerated deep learning

S Lie, ME James, M Morrison, S Arekapudi… - US Patent …, 2022 - Google Patents
Techniques in advanced deep learning provide improvements in one or more of cost,
accuracy, performance, and energy efficiency. The deep learning accelerator is …

Systems and methods for high-throughput computations in a deep neural network

R Ratnayake - US Patent 11,341,400, 2022 - Google Patents
This disclosure describes methods and systems for high-throughput computations in a fully-
connected deep neural network. Specifically, a hardware-based deep neural network …

Neuron smearing for accelerated deep learning

S Lie, M Morrison, S Arekapudi, ME James… - US Patent …, 2022 - Google Patents
Techniques in advanced deep learning provide improvements in one or more of accuracy,
performance, and energy efficiency. An array of processing elements performs flow-based …

Wavelet representation for accelerated deep learning

S Lie, GR Lauterbach, ME James… - US Patent App. 16 …, 2020 - Google Patents
Techniques in advanced deep learning provide improvements in one or more of accuracy,
performance, and energy efficiency. An array of processing elements performs flow-based …

Task synchronization for accelerated deep learning

S Lie, M Morrison, S Arekapudi, ME James… - US Patent …, 2021 - Google Patents
Techniques in advanced deep learning provide improvements in one or more of accuracy,
performance, and energy efficiency. An array of processing elements performs flow-based …

Floating-point unit stochastic rounding for accelerated deep learning

S Lie, ME James, M Morrison, GR Lauterbach… - US Patent …, 2022 - Google Patents
Techniques in advanced deep learning provide improvements in one or more of accuracy,
performance, and energy efficiency. An array of processing elements comprising a portion of …

Hybrid data-model parallelism for efficient deep learning

S Venkataramani, V Srinivasan… - US Patent …, 2023 - Google Patents
The embodiments herein describe hybrid parallelism techniques where a mix of data and
model parallelism techniques are used to split the workload of a layer across an array of …

Microthreading for accelerated deep learning

S Lie, M Morrison, ME James, GR Lauterbach… - US Patent …, 2022 - Google Patents
Techniques in advanced deep learning provide improvements in one or more of accuracy,
performance, and energy efficiency. An array of compute elements and routers performs flow …

Arithmetic unit for deep learning acceleration

SP Singh, G Desoli, T Boesch - US Patent 11,586,907, 2023 - Google Patents
(57) APSTRACT Embodiments of a device include an integrated circuit, a reconfigurable
stream switch formed in the integrated circuit, and an arithmetic unit coupled to the …