A survey on processing-in-memory techniques: Advances and challenges

K Asifuzzaman, NR Miniskar, AR Young, F Liu… - … , Devices, Circuits and …, 2023 - Elsevier
Abstract Processing-in-memory (PIM) techniques have gained much attention from computer
architecture researchers, and significant research effort has been invested in exploring and …

A review of near-memory computing architectures: Opportunities and challenges

G Singh, L Chelini, S Corda, AJ Awan… - 2018 21st Euromicro …, 2018 - ieeexplore.ieee.org
The conventional approach of moving stored data to the CPU for computation has become a
major performance bottleneck for emerging scale-out data-intensive applications due to their …

Near-memory computing: Past, present, and future

G Singh, L Chelini, S Corda, AJ Awan, S Stuijk… - Microprocessors and …, 2019 - Elsevier
The conventional approach of moving data to the CPU for computation has become a
significant performance bottleneck for emerging scale-out data-intensive applications due to …

Micro-architecture independent analytical processor performance and power modeling

S Van den Steen, S De Pestel, M Mechri… - … Analysis of Systems …, 2015 - ieeexplore.ieee.org
Optimizing processors for specific application (s) can substantially improve energy-
efficiency. With the end of Dennard scaling, and the corresponding reduction in …

Adding duty cycle only in connected dominating sets for energy efficient and fast data collection

W Shi, W Liu, T Wang, Z Zeng, G Zhi - IEEE Access, 2019 - ieeexplore.ieee.org
In wireless sensor networks (WSNs), energy efficiency and low delay are two pivotal issues
for data collection. Wireless sensor networks are composed of energy-constrained sensor …

RPPM: Rapid performance prediction of multithreaded applications on multicore hardware

S De Pestel, S Van den Steen, S Akram… - IEEE Computer …, 2018 - ieeexplore.ieee.org
This paper proposes RPPM which, based on a microarchitecture-independent profile of a
multithreaded application, predicts its performance on a previously unseen multicore …

PODTherm-GP: A Physics-Based Data-Driven Approach for Effective Architecture-Level Thermal Simulation of Multi-Core CPUs

L Jiang, A Dowling, MC Cheng… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
A thermal simulation methodology derived from the proper orthogonal decomposition (POD)
and the Galerkin projection (GP), hereafter referred to as PODTherm-GP, is evaluated in …

Mansard roofline model: Reinforcing the accuracy of the roofs

D Marques, A Ilic, L Sousa - … on Modeling and Performance Evaluation of …, 2021 - dl.acm.org
Continuous enhancements and diversity in modern multi-core hardware, such as wider and
deeper core pipelines and memory subsystems, bring to practice a set of hard-to-solve …

gem5-accel: A Pre-RTL Simulation Toolchain for Accelerator Architecture Validation

J Vieira, N Roma, G Falcao… - IEEE Computer …, 2023 - ieeexplore.ieee.org
Attaining the performance and efficiency levels required by modern applications often
requires the use of application-specific accelerators. However, writing synthesizable …

NDPmulator: Enabling Full-System Simulation for Near-Data Accelerators From Caches to DRAM

J Vieira, N Roma, G Falcao, P Tomás - IEEE Access, 2024 - ieeexplore.ieee.org
The accurate simulation and performance assessment of Near-Data Accelerators (NDAccs)
is a complex challenge as it must consider the operation of the entire processing system, the …