Simulation intelligence: Towards a new generation of scientific methods

A Lavin, D Krakauer, H Zenil, J Gottschlich… - arXiv preprint arXiv …, 2021 - arxiv.org
The original" Seven Motifs" set forth a roadmap of essential methods for the field of scientific
computing, where a motif is an algorithmic method that captures a pattern of computation …

AI and ML accelerator survey and trends

A Reuther, P Michaleas, M Jones… - 2022 IEEE High …, 2022 - ieeexplore.ieee.org
This paper updates the survey of AI accelerators and processors from past three years. This
paper collects and summarizes the current commercial accelerators that have been publicly …

AKG: automatic kernel generation for neural processing units using polyhedral transformations

J Zhao, B Li, W Nie, Z Geng, R Zhang, X Gao… - Proceedings of the …, 2021 - dl.acm.org
Existing tensor compilers have proven their effectiveness in deploying deep neural networks
on general-purpose hardware like CPU and GPU, but optimizing for neural processing units …

System technology co-optimization for advanced integration

S Pal, A Mallik, P Gupta - Nature Reviews Electrical Engineering, 2024 - nature.com
Advanced integration and packaging will drive the scaling of computing systems in the next
decade. Diversity in performance, cost and scale of the emerging systems implies that …

[HTML][HTML] Tutorial on memristor-based computing for smart edge applications

A Gebregiorgis, A Singh, A Yousefzadeh… - … , Devices, Circuits and …, 2023 - Elsevier
Smart computing on edge-devices has demonstrated huge potential for various application
sectors such as personalized healthcare and smart robotics. These devices aim at bringing …

Vegeta: Vertically-integrated extensions for sparse/dense gemm tile acceleration on cpus

G Jeong, S Damani, AR Bambhaniya… - … Symposium on High …, 2023 - ieeexplore.ieee.org
Deep Learning (DL) acceleration support in CPUs has recently gained a lot of traction, with
several companies (Arm, Intel, IBM) announcing products with specialized matrix engines …

The Central Engine of GRB170817A and the Energy Budget Issue: Kerr Black Hole versus Neutron Star in a Multi-Messenger Analysis

MHPM van Putten - Universe, 2023 - mdpi.com
Upcoming LIGO–Virgo–KAGRA (LVK) observational runs offer new opportunities to probe
the central engines of extreme transient events. Cosmological gamma-ray bursts (GRBs) …

Scalable distributed high-order stencil computations

M Jacquelin, M Araya–Polo… - … Conference for High …, 2022 - ieeexplore.ieee.org
Stencil computations lie at the heart of many scientific and industrial applications. Stencil
algorithms pose several challenges on machines with cache based memory hierarchy, due …

StencilFlow: Mapping large stencil programs to distributed spatial computing systems

J de Fine Licht, A Kuster, T De Matteis… - 2021 IEEE/ACM …, 2021 - ieeexplore.ieee.org
Spatial computing devices have been shown to significantly accelerate stencil computations,
but have so far relied on unrolling the iterative dimension of a single stencil operation to …

Bridging data center AI systems with edge computing for actionable information retrieval

Z Liu, A Ali, P Kenesei, A Miceli… - 2021 3rd Annual …, 2021 - ieeexplore.ieee.org
Extremely high data rates at modern synchrotron and X-ray free-electron laser light source
beamlines motivate the use of machine learning methods for data reduction, feature …