A survey of machine learning for computer architecture and systems

N Wu, Y Xie - ACM Computing Surveys (CSUR), 2022 - dl.acm.org
It has been a long time that computer architecture and systems are optimized for efficient
execution of machine learning (ML) models. Now, it is time to reconsider the relationship …

Two sides of the same coin: Heterophily and oversmoothing in graph convolutional neural networks

Y Yan, M Hashemi, K Swersky, Y Yang… - … Conference on Data …, 2022 - ieeexplore.ieee.org
In node classification tasks, graph convolutional neural networks (GCNs) have
demonstrated competitive performance over traditional methods on diverse graph data …

Programl: A graph-based program representation for data flow analysis and compiler optimizations

C Cummins, ZV Fisches, T Ben-Nun… - International …, 2021 - proceedings.mlr.press
Abstract Machine learning (ML) is increasingly seen as a viable approach for building
compiler optimization heuristics, but many ML methods cannot replicate even the simplest of …

Pythia: A customizable hardware prefetching framework using online reinforcement learning

R Bera, K Kanellopoulos, A Nori, T Shahroodi… - MICRO-54: 54th Annual …, 2021 - dl.acm.org
Past research has proposed numerous hardware prefetching techniques, most of which rely
on exploiting one specific type of program context information (eg, program counter …

Trex: Learning execution semantics from micro-traces for binary similarity

K Pei, Z Xuan, J Yang, S Jana, B Ray - arXiv preprint arXiv:2012.08680, 2020 - arxiv.org
Detecting semantically similar functions--a crucial analysis capability with broad real-world
security usages including vulnerability detection, malware lineage, and forensics--requires …

An imitation learning approach for cache replacement

E Liu, M Hashemi, K Swersky… - International …, 2020 - proceedings.mlr.press
Program execution speed critically depends on increasing cache hits, as cache hits are
orders of magnitude faster than misses. To increase cache hits, we focus on the problem of …

An evaluation of edge tpu accelerators for convolutional neural networks

K Seshadri, B Akin, J Laudon… - 2022 IEEE …, 2022 - ieeexplore.ieee.org
Edge TPUs are a domain of accelerators for low-power, edge devices and are widely used
in various Google products such as Coral and Pixel devices. In this paper, we first discuss …

Learning to execute programs with instruction pointer attention graph neural networks

D Bieber, C Sutton, H Larochelle… - Advances in Neural …, 2020 - proceedings.neurips.cc
Graph neural networks (GNNs) have emerged as a powerful tool for learning software
engineering tasks including code completion, bug finding, and program repair. They benefit …

GRANITE: A graph neural network model for basic block throughput estimation

O Sýkora, PM Phothilimthana, C Mendis… - 2022 IEEE …, 2022 - ieeexplore.ieee.org
Analytical hardware performance models yield swift estimation of desired hardware
performance metrics. However, developing these analytical models for modern processors …

Neural execution engines: Learning to execute subroutines

Y Yan, K Swersky, D Koutra… - Advances in …, 2020 - proceedings.neurips.cc
A significant effort has been made to train neural networks that replicate algorithmic
reasoning, but they often fail to learn the abstract concepts underlying these algorithms. This …