CAFQA: A classical simulation bootstrap for variational quantum algorithms

GS Ravi, P Gokhale, Y Ding, W Kirby, K Smith… - Proceedings of the 28th …, 2022 - dl.acm.org
Classical computing plays a critical role in the advancement of quantum frontiers in the NISQ
era. In this spirit, this work uses classical simulation to bootstrap Variational Quantum …

Automating reinforcement learning architecture design for code optimization

H Wang, Z Tang, C Zhang, J Zhao, C Cummins… - Proceedings of the 31st …, 2022 - dl.acm.org
Reinforcement learning (RL) is emerging as a powerful technique for solving complex code
optimization tasks with an ample search space. While promising, existing solutions require a …

ytopt: Autotuning scientific applications for energy efficiency at large scales

X Wu, P Balaprakash, M Kruse, J Koo, B Videau… - arXiv preprint arXiv …, 2023 - arxiv.org
As we enter the exascale computing era, efficiently utilizing power and optimizing the
performance of scientific applications under power and energy constraints has become …

Ribbon: cost-effective and qos-aware deep learning model inference using a diverse pool of cloud computing instances

B Li, RB Roy, T Patel, V Gadepally, K Gettings… - Proceedings of the …, 2021 - dl.acm.org
Deep learning model inference is a key service in many businesses and scientific discovery
processes. This paper introduces Ribbon, a novel deep learning inference serving system …

Scalable Tuning of (OpenMP) GPU Applications via Kernel Record and Replay

K Parasyris, G Georgakoudis, E Rangel… - Proceedings of the …, 2023 - dl.acm.org
HPC is a heterogeneous world in which host and device code are interleaved throughout
the application. Given the significant performance advantage of accelerators, device code …

Harnessing the crowd for autotuning high-performance computing applications

Y Cho, JW Demmel, J King, XS Li… - 2023 IEEE …, 2023 - ieeexplore.ieee.org
This paper presents GPTuneCrowd, a crowd-based autotuning framework for tuning high-
performance computing applications. GPTuneCrowd collects performance data from various …

Performance optimization using multimodal modeling and heterogeneous gnn

A Dutta, J Alcaraz, A TehraniJamsaz, E Cesar… - Proceedings of the …, 2023 - dl.acm.org
Growing heterogeneity and configurability in HPC architectures has made auto-tuning
applications and runtime parameters on these systems very complex. Users are presented …

Baco: A fast and portable Bayesian compiler optimization framework

EO Hellsten, A Souza, J Lenfers, R Lacouture… - Proceedings of the 28th …, 2023 - dl.acm.org
We introduce the Bayesian Compiler Optimization framework (BaCO), a general purpose
autotuner for modern compilers targeting CPUs, GPUs, and FPGAs. BaCO provides the …

Phronesis: Efficient performance modeling for high-dimensional configuration tuning

Y Li, BC Lee - ACM Transactions on Architecture and Code …, 2022 - dl.acm.org
We present Phronesis, a learning framework for efficiently modeling the performance of data
analytic workloads as a function of their high-dimensional software configuration …

[HTML][HTML] Automated linear solver selection for simulation of multiphysics processes in porous media

Y Zabegaev, E Keilegavlen, E Iversen, I Berre - Computer Methods in …, 2024 - Elsevier
Porous media processes involve various physical phenomena such as mechanical
deformation, transport, and fluid flow. Accurate simulations must capture the strong …