A survey on compiler autotuning using machine learning

AH Ashouri, W Killian, J Cavazos, G Palermo… - ACM Computing …, 2018 - dl.acm.org
Since the mid-1990s, researchers have been trying to use machine-learning-based
approaches to solve a number of different compiler optimization problems. These …

End-to-end deep learning of optimization heuristics

C Cummins, P Petoumenos, Z Wang… - 2017 26th …, 2017 - ieeexplore.ieee.org
Accurate automatic optimization heuristics are necessary for dealing with thecomplexity and
diversity of modern hardware and software. Machine learning is aproven technique for …

Asymo: scalable and efficient deep-learning inference on asymmetric mobile cpus

M Wang, S Ding, T Cao, Y Liu, F Xu - Proceedings of the 27th Annual …, 2021 - dl.acm.org
On-device deep learning (DL) inference has attracted vast interest. Mobile CPUs are the
most common hardware for on-device inference and many inference frameworks have been …

Synthesizing benchmarks for predictive modeling

C Cummins, P Petoumenos, Z Wang… - 2017 IEEE/ACM …, 2017 - ieeexplore.ieee.org
Predictive modeling using machine learning is an effective method for building compiler
heuristics, but there is a shortage of benchmarks. Typical machine learning experiments …

Minimizing the cost of iterative compilation with active learning

WF Ogilvie, P Petoumenos, Z Wang… - 2017 IEEE/ACM …, 2017 - ieeexplore.ieee.org
Since performance is not portable between platforms, engineers must fine-tune heuristics for
each processor in turn. This is such a laborious task that high-profile compilers, supporting …

Romou: Rapidly generate high-performance tensor kernels for mobile gpus

R Liang, T Cao, J Wen, M Wang, Y Wang… - Proceedings of the 28th …, 2022 - dl.acm.org
Mobile GPU, as a ubiquitous and powerful accelerator, plays an important role in
accelerating on-device DNN (Deep Neural Network) inference. The frequent-upgrade and …

Machine learning‐based auto‐tuning for enhanced performance portability of OpenCL applications

TL Falch, AC Elster - Concurrency and Computation: Practice …, 2017 - Wiley Online Library
Heterogeneous computing, combining devices with different architectures such as CPUs
and GPUs, is rising in popularity and promises increased performance combined with …

Exploring execution schemes for agent-based traffic simulation on heterogeneous hardware

J Xiao, P Andelfinger, D Eckhoff… - 2018 IEEE/ACM 22nd …, 2018 - ieeexplore.ieee.org
Microscopic traffic simulation is associated with substantial runtimes, limiting the feasibility of
large-scale evaluation of traffic scenarios. Even though today heterogeneous hardware …

Profiling and optimizing deep learning inference on mobile GPUs

S Jiang, L Ran, T Cao, Y Xu, Y Liu - … of the 11th ACM SIGOPS Asia …, 2020 - dl.acm.org
Mobile GPU, as the ubiquitous computing hardware on almost every smartphone, is being
exploited for the deep learning inference. In this paper, we present our measurements on …

Deep learning for compilers

CE Cummins - 2020 - era.ed.ac.uk
Constructing compilers is hard. Optimising compilers are multi-million dollar projects
spanning years of development, yet remain unable to fully exploit the available performance …