Neural Network Processor with On-Chip Convolution Kernel Storage

J Li, J Zhang - US Patent App. 15/887,367, 2019 - Google Patents
US20190244080A1 - Neural Network Processor with On-Chip Convolution Kernel Storage -
Google Patents US20190244080A1 - Neural Network Processor with On-Chip Convolution …

System and method for parallelizing convolutional neural networks

A Krizhevsky, I Sutskever, GE Hinton - US Patent 9,563,840, 2017 - Google Patents
(57) ABSTRACT A parallel convolutional neural network is provided. The CNN is
implemented by a plurality of convolutional neural networks each on a respective …

Forward propagation of secondary objective for deep learning

JK Baker - US Patent App. 16/619,346, 2021 - Google Patents
Computer systems and methods optimize a secondary objec tive function in the training of a
multi-layer feed-forward neural network in which the secondary objective is a func tion of the …

Convolution matrix multiply with callback for deep tiling for deep convolutional neural networks

DHF Dijkman, M Badin - US Patent App. 14/845,243, 2016 - Google Patents
A method of address translation of images and filters to Vir tual matrices to perform a
convolution by matrix multiplica tion includes receiving an image and a filter. Each image …

Efficient neural network accelerator dataflows

Y Shao, R Venkatesan, M Wang, D Smith… - US Patent …, 2022 - Google Patents
(57) ABSTRACT A distributed deep neural net (DNN) utilizing a distributed, tile-based
architecture includes multiple chips, each with a central processing element, a global …

Deep learning using alternating direction method of multipliers

Q Huo, ZJ Yan, K Chen - US Patent 10,579,922, 2020 - Google Patents
Described herein are techniques for training deep neural networks (DNNs) using an
alternating direction method of multipliers (ADMM) algorithm. The DNNs may be trained to …

Neural hardware accelerator for parallel and distributed tensor computations

DS Modha - US Patent App. 15/967,482, 2019 - Google Patents
BACKGROUND [0001] Embodiments of the present disclosure relate to a hardware
accelerator for parallel and distributed tensor computations, and more specifically, to neural …

Parallelizing the training of convolutional neural networks

A Krizhevsky - US Patent 10,540,587, 2020 - Google Patents
Methods, systems, and apparatus, including computer programs encoded on computer
storage media, for training a convolutional neural network (CNN). The system includes a …

Approximate synchronization for parallel deep learning

S Gupta, R Nair - US Patent 10,338,931, 2019 - Google Patents
Techniques facilitating synchronization of processing engines for parallel deep learning are
provided. In one example, a first processing component associated with a processor and …

Wavelet representation for accelerated deep learning

S Lie, GR Lauterbach, ME James… - US Patent App. 16 …, 2020 - Google Patents
Techniques in advanced deep learning provide improvements in one or more of accuracy,
performance, and energy efficiency. An array of processing elements performs flow-based …