Large language models: a comprehensive survey of its applications, challenges, limitations, and future prospects

MU Hadi, R Qureshi, A Shah, M Irfan, A Zafar… - Authorea …, 2023 - techrxiv.org
Within the vast expanse of computerized language processing, a revolutionary entity known
as Large Language Models (LLMs) has emerged, wielding immense power in its capacity to …

A survey of machine learning for computer architecture and systems

N Wu, Y Xie - ACM Computing Surveys (CSUR), 2022 - dl.acm.org
It has been a long time that computer architecture and systems are optimized for efficient
execution of machine learning (ML) models. Now, it is time to reconsider the relationship …

Twig: Multi-agent task management for colocated latency-critical cloud services

R Nishtala, V Petrucci, P Carpenter… - … Symposium on High …, 2020 - ieeexplore.ieee.org
Many of the important services running on data centres are latency-critical, time-varying, and
demand strict user satisfaction. Stringent tail-latency targets for colocated services and …

Fleet: Online federated learning via staleness awareness and performance prediction

G Damaskinos, R Guerraoui, AM Kermarrec… - ACM Transactions on …, 2022 - dl.acm.org
Federated learning (FL) is very appealing for its privacy benefits: essentially, a global model
is trained with updates computed on mobile devices while keeping the data of users local …

Generalizable and interpretable learning for configuration extrapolation

Y Ding, A Pervaiz, M Carbin, H Hoffmann - … of the 29th ACM joint meeting …, 2021 - dl.acm.org
Modern software applications are increasingly configurable, which puts a burden on users to
tune these configurations for their target hardware and workloads. To help users, machine …

A survey of machine learning applied to computer architecture design

DD Penney, L Chen - arXiv preprint arXiv:1909.12373, 2019 - arxiv.org
Machine learning has enabled significant benefits in diverse fields, but, with a few
exceptions, has had limited impact on computer architecture. Recent work, however, has …

Cohmeleon: Learning-based orchestration of accelerator coherence in heterogeneous SoCs

J Zuckerman, D Giri, J Kwon, P Mantovani… - MICRO-54: 54th Annual …, 2021 - dl.acm.org
One of the most critical aspects of integrating loosely-coupled accelerators in
heterogeneous SoC architectures is orchestrating their interactions with the memory …

Fegan: Scaling distributed gans

R Guerraoui, A Guirguis, AM Kermarrec… - Proceedings of the 21st …, 2020 - dl.acm.org
Existing approaches to distribute Generative Adversarial Networks (GANs) either (i) fail to
scale for they typically put the two components of a GAN (the generator and the …

Core placement optimization for multi-chip many-core neural network systems with reinforcement learning

N Wu, L Deng, G Li, Y Xie - ACM Transactions on Design Automation of …, 2020 - dl.acm.org
Multi-chip many-core neural network systems are capable of providing high parallelism
benefited from decentralized execution, and they can be scaled to very large systems with …

Bayesperf: minimizing performance monitoring errors using bayesian statistics

SS Banerjee, S Jha, Z Kalbarczyk, RK Iyer - Proceedings of the 26th …, 2021 - dl.acm.org
Hardware performance counters (HPCs) that measure low-level architectural and
microarchitectural events provide dynamic contextual information about the state of the …