The Mozart reuse exposed dataflow processor for AI and beyond: Industrial product

K Sankaralingam, T Nowatzki, V Gangadhar… - Proceedings of the 49th …, 2022 - dl.acm.org
In this paper we introduce the Mozart Processor, which implements a new processing
paradigm called Reuse Exposed Dataflow (RED). RED is a counterpart to existing execution …

Arvon: A Heterogeneous System-in-Package Integrating FPGA and DSP Chiplets for Versatile Workload Acceleration

W Tang, SG Cho, TT Hoang, J Botimer… - IEEE Journal of Solid …, 2023 - ieeexplore.ieee.org
Integrating heterogeneous chiplets in a package presents a promising and cost-effective
approach to constructing scalable and flexible systems for accelerating a wide range of …

Machine Learning Hardware Design for Efficiency, Flexibility, and Scalability [Feature]

JF Zhang, Z Zhang - IEEE Circuits and Systems Magazine, 2023 - ieeexplore.ieee.org
The widespread use of deep neural networks (DNNs) and DNN-based machine learning
(ML) methods justifies DNN computation as a workload class itself. Beginning with a brief …

Methods for multiplying matrices using a plurality of chiplets

JM Bodwin - US Patent 12,001,508, 2024 - Google Patents
A plurality of chiplets may be used to multiply two matrices A and B. Matrix A may be
decomposed into horizontal stripes and matrix B may be decomposed into vertical stripes …