Understanding GPU power: A survey of profiling, modeling, and simulation methods

RA Bridges, N Imam, TM Mintz - ACM Computing Surveys (CSUR), 2016 - dl.acm.org
Modern graphics processing units (GPUs) have complex architectures that admit
exceptional performance and energy efficiency for high-throughput applications. Although …

A survey of performance optimization for mobile applications

M Hort, M Kechagia, F Sarro… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
To ensure user satisfaction and success of mobile applications, it is important to provide
highly performant applications. This is particularly important for resource-constrained …

Accel-Sim: An extensible simulation framework for validated GPU modeling

M Khairy, Z Shen, TM Aamodt… - 2020 ACM/IEEE 47th …, 2020 - ieeexplore.ieee.org
In computer architecture, significant innovation frequently comes from industry. However, the
simulation tools used by industry are often not released for open use, and even when they …

Nvbit: A dynamic binary instrumentation framework for nvidia gpus

O Villa, M Stephenson, D Nellans… - Proceedings of the 52nd …, 2019 - dl.acm.org
Binary instrumentation frameworks are widely used to implement profilers, performance
evaluation, error checking, and bug detection tools. While dynamic binary instrumentation …

Nvbitfi: Dynamic fault injection for gpus

T Tsai, SKS Hari, M Sullivan, O Villa… - 2021 51st Annual …, 2021 - ieeexplore.ieee.org
GPUs have found wide acceptance in domains such as high-performance computing and
autonomous vehicles, which require fast processing of large amounts of data along with …

SASSIFI: An architecture-level fault injection tool for GPU application resilience evaluation

SKS Hari, T Tsai, M Stephenson… - … Analysis of Systems …, 2017 - ieeexplore.ieee.org
As GPUs become more pervasive in both scalable high-performance computing systems
and safety-critical embedded systems, evaluating and analyzing their resilience to soft errors …

Optimizing software-directed instruction replication for gpu error detection

A Mahmoud, SKS Hari, MB Sullivan… - … Conference for High …, 2018 - ieeexplore.ieee.org
Application execution on safety-critical and high-performance computer systems must be
resilient to transient errors. As GPUs become more pervasive in such systems, they must …

Ten quick tips for bioinformatics analyses using an Apache Spark distributed computing environment

D Chicco, U Ferraro Petrillo… - PLOS Computational …, 2023 - journals.plos.org
Some scientific studies involve huge amounts of bioinformatics data that cannot be analyzed
on personal computers usually employed by researchers for day-to-day activities but rather …

Hybrid, scalable, trace-driven performance modeling of GPGPUs

Y Arafa, AH Badawy, A ElWazir, A Barai… - Proceedings of the …, 2021 - dl.acm.org
In this paper, we present PPT-GPU, a scalable performance prediction toolkit for GPUs. PPT-
GPU achieves scalability through a hybrid high-level modeling approach where some …

A simple model for portable and fast prediction of execution time and power consumption of GPU kernels

L Braun, S Nikas, C Song, V Heuveline… - ACM Transactions on …, 2020 - dl.acm.org
Characterizing compute kernel execution behavior on GPUs for efficient task scheduling is a
non-trivial task. We address this with a simple model enabling portable and fast predictions …