Bridging python to silicon: The soda toolchain

NB Agostini, S Curzel, JJ Zhang, A Limaye, C Tan… - IEEE Micro, 2022 - ieeexplore.ieee.org
Systems performing scientific computing, data analysis, and machine learning tasks have a
growing demand for application-specific accelerators that can provide high computational …

Data motion acceleration: Chaining cross-domain multi accelerators

ST Wang, H Xu, A Mamandipoor… - … Symposium on High …, 2024 - ieeexplore.ieee.org
There has been an arms race for devising accelerators for deep learning in recent years.
However, real-world applications are not only neural networks but often span across …

Energy-efficient hardware acceleration of shallow machine learning applications

Z Zeng, SS Sapatnekar - 2023 Design, Automation & Test in …, 2023 - ieeexplore.ieee.org
ML accelerators have largely focused on building general platforms for deep neural
networks (DNNs), but less so on shallow machine learning (SML) algorithms. This paper …

An Open-Source ML-Based Full-Stack Optimization Framework for Machine Learning Accelerators

H Esmaeilzadeh, S Ghodrati, A Kahng, JK Kim… - ACM Transactions on …, 2024 - dl.acm.org
Parameterizable machine learning (ML) accelerators are the product of recent
breakthroughs in ML. To fully enable their design space exploration (DSE), we propose a …

Hier-RTLMP: A hierarchical automatic macro placer for large-scale complex IP blocks

AB Kahng, R Varadarajan… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
In a typical RTL to GDSII flow, floorplanning or macro placement is a critical step in
achieving decent quality of results (QoR). Moreover, in today's physical synthesis flows (eg …

End-to-end synthesis of dynamically controlled machine learning accelerators

S Curzel, NB Agostini, VG Castellana… - IEEE Transactions …, 2022 - ieeexplore.ieee.org
Edge systems are required to autonomously make real-time decisions based on large
quantities of input data under strict power, performance, area, and other constraints. Meeting …

Exploring machine learning to hardware implementations for large data rate x-ray instrumentation

MM Rahimifar, Q Wingering… - Machine Learning …, 2023 - iopscience.iop.org
Over the past decade, innovations in radiation and photonic detectors considerably
improved their resolution, pixel density, sensitivity, and sampling rate, which all contribute to …

Physically accurate learning-based performance prediction of hardware-accelerated ml algorithms

H Esmaeilzadeh, S Ghodrati, AB Kahng… - Proceedings of the …, 2022 - dl.acm.org
Parameterizable ML accelerators are the product of recent breakthroughs in machine
learning (ML). To fully enable the design space exploration, we propose a physical-design …

Reusing GEMM hardware for efficient execution of depthwise separable convolution on ASIC-based DNN accelerators

SD Manasi, S Banerjee, A Davare, AA Sorokin… - Proceedings of the 28th …, 2023 - dl.acm.org
Deep learning (DL) accelerators are optimized for standard convolution. However,
lightweight convolutional neural networks (CNNs) use depthwise convolution (DwC) in key …

Performance Analysis of DNN Inference/Training with Convolution and non-Convolution Operations

H Esmaeilzadeh, S Ghodrati, AB Kahng… - arXiv preprint arXiv …, 2023 - arxiv.org
Today's performance analysis frameworks for deep learning accelerators suffer from two
significant limitations. First, although modern convolutional neural network (CNNs) consist of …