Accel-Sim: An extensible simulation framework for validated GPU modeling

M Khairy, Z Shen, TM Aamodt… - 2020 ACM/IEEE 47th …, 2020 - ieeexplore.ieee.org
In computer architecture, significant innovation frequently comes from industry. However, the
simulation tools used by industry are often not released for open use, and even when they …

AccelWattch: A power modeling framework for modern GPUs

V Kandiah, S Peverelle, M Khairy, J Pan… - MICRO-54: 54th Annual …, 2021 - dl.acm.org
Graphics Processing Units (GPUs) are rapidly dominating the accelerator space, as
illustrated by their wide-spread adoption in the data analytics and machine learning markets …

A survey on data analysis on large-Scale wireless networks: online stream processing, trends, and challenges

DSV Medeiros, HN Cunha Neto, MA Lopez… - Journal of Internet …, 2020 - Springer
In this paper we focus on knowledge extraction from large-scale wireless networks through
stream processing. We present the primary methods for sampling, data collection, and …

Energy efficient computing systems: Architectures, abstractions and modeling to techniques and standards

R Muralidhar, R Borovica-Gajic, R Buyya - ACM Computing Surveys …, 2022 - dl.acm.org
Computing systems have undergone a tremendous change in the last few decades with
several inflexion points. While Moore's law guided the semiconductor industry to cram more …

Towards inspecting and eliminating trojan backdoors in deep neural networks

W Guo, L Wang, Y Xu, X Xing, M Du… - 2020 IEEE International …, 2020 - ieeexplore.ieee.org
A trojan backdoor is a hidden pattern typically implanted in a deep neural network (DNN). It
could be activated and thus forces that infected model to behave abnormally when an input …

Llmcompass: Enabling efficient hardware design for large language model inference

H Zhang, A Ning, RB Prabhakar… - 2024 ACM/IEEE 51st …, 2024 - ieeexplore.ieee.org
The past year has witnessed the increasing popularity of Large Language Models (LLMs).
Their unprecedented scale and associated high hardware cost have impeded their broader …

Stonne: Enabling cycle-level microarchitectural simulation for dnn inference accelerators

F Muñoz-Martínez, JL Abellán… - 2021 IEEE …, 2021 - ieeexplore.ieee.org
The design of specialized architectures for accelerating the inference procedure of Deep
Neural Networks (DNNs) is a booming area of research nowadays. While first-generation …

Gme: Gpu-based microarchitectural extensions to accelerate homomorphic encryption

K Shivdikar, Y Bao, R Agrawal, M Shen… - Proceedings of the 56th …, 2023 - dl.acm.org
Fully Homomorphic Encryption (FHE) enables the processing of encrypted data without
decrypting it. FHE has garnered significant attention over the past decade as it supports …

Characterizing and modeling non-volatile memory systems

Z Wang, X Liu, J Yang, T Michailidis… - 2020 53rd Annual …, 2020 - ieeexplore.ieee.org
Scalable server-grade non-volatile RAM (NVRAM) DIMMs became commercially available
with the release of Intel's Optane DIMM. Recent studies on Optane DIMM systems unveil …

Grus: Toward unified-memory-efficient high-performance graph processing on gpu

P Wang, J Wang, C Li, J Wang, H Zhu… - ACM Transactions on …, 2021 - dl.acm.org
Today's GPU graph processing frameworks face scalability and efficiency issues as the
graph size exceeds GPU-dedicated memory limit. Although recent GPUs can over-subscribe …