Pythia: A customizable hardware prefetching framework using online reinforcement learning

R Bera, K Kanellopoulos, A Nori, T Shahroodi… - MICRO-54: 54th Annual …, 2021 - dl.acm.org
Past research has proposed numerous hardware prefetching techniques, most of which rely
on exploiting one specific type of program context information (eg, program counter …

Fine-grained address segmentation for attention-based variable-degree prefetching

P Zhang, A Srivastava, AV Nori, R Kannan… - Proceedings of the 19th …, 2022 - dl.acm.org
Machine learning algorithms have shown potential to improve prefetching performance by
accurately predicting future memory accesses. Existing approaches are based on the …

Predicting memory accesses: the road to compact ml-driven prefetcher

A Srivastava, A Lazaris, B Brooks, R Kannan… - Proceedings of the …, 2019 - dl.acm.org
With the advent of fast processors, TPUs, accelerators, and heterogeneous architectures,
computation is no longer the only bottleneck. In fact for many applications, speed of …

Raop: Recurrent neural network augmented offset prefetcher

P Zhang, A Srivastava, B Brooks, R Kannan… - Proceedings of the …, 2020 - dl.acm.org
The rapid development of Big Data coupled with slowing down of Moore's law has made the
memory performance a bottleneck in the von Neumann architecture. Machine learning has …

Sharp: Software hint-assisted memory access prediction for graph analytics

P Zhang, R Kannan, X Tong, AV Nori… - 2022 IEEE High …, 2022 - ieeexplore.ieee.org
Memory system performance is a major bottleneck in large-scale graph analytics. Data
prefetching can hide memory latency; this relies on accurate prediction of memory accesses …

Phases, Modalities, Spatial and Temporal Locality: Domain Specific ML Prefetcher for Accelerating Graph Analytics

P Zhang, R Kannan, VK Prasanna - Proceedings of the International …, 2023 - dl.acm.org
Memory performance is a key bottleneck in accelerating graph analytics. Existing Machine
Learning (ML) prefetchers encounter challenges with phase transitions and irregular …

Memmap: Compact and generalizable meta-lstm models for memory access prediction

A Srivastava, TY Wang, P Zhang, CAF De Rose… - Advances in Knowledge …, 2020 - Springer
With the rise of Big Data, there has been a significant effort in increasing compute power
through GPUs, TPUs, and heterogeneous architectures. As a result, many applications are …

Transformap: Transformer for memory access prediction

P Zhang, A Srivastava, AV Nori, R Kannan… - arXiv preprint arXiv …, 2022 - arxiv.org
Data Prefetching is a technique that can hide memory latency by fetching data before it is
needed by a program. Prefetching relies on accurate memory access prediction, to which …

MetaSys: A practical open-source metadata management system to implement and evaluate cross-layer optimizations

N Vijaykumar, A Olgun, K Kanellopoulos… - ACM Transactions on …, 2022 - dl.acm.org
This article introduces the first open-source FPGA-based infrastructure, MetaSys, with a
prototype in a RISC-V system, to enable the rapid implementation and evaluation of a wide …

Memory virtualization for accessing heterogeneous memory components

A Ray, PR Maharana, G Anand - US Patent 11,416,395, 2022 - Google Patents
A computing system having at least one bus, a plurality of different memory components,
and a processing device operatively coupled with the plurality of memory components …