AI and ML accelerator survey and trends

A Reuther, P Michaleas, M Jones… - 2022 IEEE High …, 2022 - ieeexplore.ieee.org
This paper updates the survey of AI accelerators and processors from past three years. This
paper collects and summarizes the current commercial accelerators that have been publicly …

Allo: A Programming Model for Composable Accelerator Design

H Chen, N Zhang, S Xiang, Z Zeng, M Dai… - Proceedings of the ACM …, 2024 - dl.acm.org
Special-purpose hardware accelerators are increasingly pivotal for sustaining performance
improvements in emerging applications, especially as the benefits of technology scaling …

Flashfftconv: Efficient convolutions for long sequences with tensor cores

DY Fu, H Kumbong, E Nguyen, C Ré - arXiv preprint arXiv:2311.05908, 2023 - arxiv.org
Convolution models with long filters have demonstrated state-of-the-art reasoning abilities in
many long-sequence tasks but lag behind the most optimized Transformers in wall-clock …

Muchisim: A simulation framework for design exploration of multi-chip manycore systems

M Orenes-Vera, E Tureci, M Martonosi… - … Analysis of Systems …, 2024 - ieeexplore.ieee.org
The design space exploration of scaled-out manycores for communication-intensive
applications (eg, graph analytics and sparse linear algebra) is hampered due to either lack …

Utilizing modern computer architectures to solve mathematical optimization problems: A survey

DEB Neira, CD Laird, LR Lueg, SM Harwood… - Computers & Chemical …, 2024 - Elsevier
Numerical algorithms to solve mathematical optimization problems efficiently are essential to
applications in many areas of engineering and computational science. To solve optimization …

[PDF][PDF] Morpher: An open-source integrated compilation and simulation framework for cgra

D Wijerathne, Z Li, M Karunaratne… - Fifth Workshop on …, 2022 - woset-workshop.github.io
This paper presents Morpher, an open-source endto-end compilation and simulation
framework, to assist design space exploration and application-level developments of CGRA …

Tram: An open-source template-based reconfigurable architecture modeling framework

Y Qiu, Y Cao, Y Dai, W Yin… - 2022 32nd International …, 2022 - ieeexplore.ieee.org
Coarse-grained reconfigurable architecture (CGRA) is a promising accelerator design
choice due to its high performance and power efficiency in the computation or data-intensive …

Bridging data center AI systems with edge computing for actionable information retrieval

Z Liu, A Ali, P Kenesei, A Miceli… - 2021 3rd Annual …, 2021 - ieeexplore.ieee.org
Extremely high data rates at modern synchrotron and X-ray free-electron laser light source
beamlines motivate the use of machine learning methods for data reduction, feature …

Opencgra: Democratizing coarse-grained reconfigurable arrays

C Tan, NB Agostini, J Zhang, M Minutoli… - 2021 IEEE 32nd …, 2021 - ieeexplore.ieee.org
Reconfigurable architectures are today experiencing a renewed interest for their ability to
provide specialization without sacrificing the capability to adapt to disparate workloads …

Cohort: Software-oriented acceleration for heterogeneous socs

T Wei, N Turtayeva, M Orenes-Vera, O Lonkar… - Proceedings of the 28th …, 2023 - dl.acm.org
Philosophically, our approaches to acceleration focus on the extreme. We must optimise
accelerators to the maximum, leaving software to fix any hardware-software mismatches …