Anghabench: A suite with one million compilable c benchmarks for code-size reduction

AF Da Silva, BC Kind… - 2021 IEEE/ACM …, 2021 - ieeexplore.ieee.org
A predictive compiler uses properties of a program to decide how to optimize it. The compiler
is trained on a collection of programs to derive a model which determines its actions in face …

ExeBench: an ML-scale dataset of executable C functions

J Armengol-Estapé, J Woodruff, A Brauckmann… - Proceedings of the 6th …, 2022 - dl.acm.org
Machine-learning promises to transform compilation and software engineering, yet is
frequently limited by the scope of available datasets. In particular, there is a lack of runnable …

Large language models for compiler optimization

C Cummins, V Seeker, D Grubisic, M Elhoushi… - arXiv preprint arXiv …, 2023 - arxiv.org
We explore the novel application of Large Language Models to code optimization. We
present a 7B-parameter transformer model trained from scratch to optimize LLVM assembly …

Towards enhancing in-context learning for code generation

J Li, Y Zhao, Y Li, G Li, Z Jin - arXiv preprint arXiv:2303.17780, 2023 - arxiv.org
In-context learning (ICL) with pre-trained language models (PTLMs) has shown great
success in code generation. ICL does not require training. PTLMs take as the input a prompt …

Bring your own codegen to deep learning compiler

Z Chen, CH Yu, T Morris, J Tuyls, YH Lai… - arXiv preprint arXiv …, 2021 - arxiv.org
Deep neural networks (DNNs) have been ubiquitously applied in many applications, and
accelerators are emerged as an enabler to support the fast and efficient inference tasks of …

Srtuner: Effective compiler optimization customization by exposing synergistic relations

S Park, S Latifi, Y Park, A Behroozi… - 2022 IEEE/ACM …, 2022 - ieeexplore.ieee.org
Despite ceaseless efforts, extremely large and complex optimization space makes even the
state-of-the-art compilers fail in delivering the most performant setting that can fully utilize the …

Mlgo: a machine learning guided compiler optimizations framework

M Trofin, Y Qian, E Brevdo, Z Lin… - arXiv preprint arXiv …, 2021 - arxiv.org
Leveraging machine-learning (ML) techniques for compiler optimizations has been widely
studied and explored in academia. However, the adoption of ML in general-purpose …

Scope is all you need: Transforming LLMs for HPC Code

T Kadosh, N Hasabnis, VA Vo, N Schneider… - arXiv preprint arXiv …, 2023 - arxiv.org
With easier access to powerful compute resources, there is a growing trend in the field of AI
for software development to develop larger and larger language models (LLMs) to address a …

Octopack: Instruction tuning code large language models

N Muennighoff, Q Liu, A Zebaze, Q Zheng… - arXiv preprint arXiv …, 2023 - arxiv.org
Finetuning large language models (LLMs) on instructions leads to vast performance
improvements on natural language tasks. We apply instruction tuning using code …

Codegen: An open large language model for code with multi-turn program synthesis

E Nijkamp, B Pang, H Hayashi, L Tu, H Wang… - arXiv preprint arXiv …, 2022 - arxiv.org
Program synthesis strives to generate a computer program as a solution to a given problem
specification, expressed with input-output examples or natural language descriptions. The …