Domain-specific architectures: Research problems and promising approaches

A Krishnakumar, U Ogras, R Marculescu… - ACM Transactions on …, 2023 - dl.acm.org
Process technology-driven performance and energy efficiency improvements have slowed
down as we approach physical design limits. General-purpose manycore architectures …

INTIACC: A Programmable Floating-Point Accelerator for Partial Differential Equations

PX Huang, Y Tsividis, M Seok - IEEE Journal of Solid-State …, 2024 - ieeexplore.ieee.org
This article presents a 32-bit floating-point (FP32) programmable accelerator for solving a
wide range of partial differential equations (PDEs) based on numerical integration methods …

[PDF][PDF] Domain-specific architectures (DSAs): Research problems and promising approaches

A Krishnakumar, UY Ogras, R Marculescu… - ACM Trans. Embed …, 2022 - academia.edu
Authors' addresses: Anish Krishnakumar, anish. n. krishnakumar@ wisc. edu; Umit Y. Ogras,
uogras@ wisc. edu, University of Wisconsin-Madison, USA; Radu Marculescu, radum …

Gen-acceleration: Pioneering work for hardware accelerator generation using large language models

DLVD Vungarala - 2023 - digitalcommons.njit.edu
Optimizing computational power is critical in the age of data-intensive applications and
Artificial Intelligence (AI)/Machine Learning (ML). While facing challenging bottlenecks …

An Adaptive Replacement Strategy LWIRR for Shared Last Level Cache L3 in Multi-core Processors

N Sahu, B Dash, PK Pattnaik… - … Conference on Trends in …, 2022 - Springer
Multi-core processors from different processor design companies such as Intel; AMD
introduces shared cache memory architecture to improve the performance and better …

Accelerating Graph Analytics on a Reconfigurable Architecture with a Data-Indirect Prefetcher

Y Yang, J Li, N Talati, S Pal, S Feng… - arXiv preprint arXiv …, 2023 - arxiv.org
The irregular nature of memory accesses of graph workloads makes their performance poor
on modern computing platforms. On manycore reconfigurable architectures (MRAs), in …

An augmented cache replacement policy RTC_RRIP for last level cache L3 in state of art multi-core processors

N Sahu, B Dash, PK Pattnaik - AIP Conference Proceedings, 2023 - pubs.aip.org
Multi-core processor with hierarchical multilevel cache helps to reduce memory access time
and also lower down the speed gap between primary memory and processor. Many …

Exploiting Reconfiguration and Co-Design for Domain-Agnostic Hardware Acceleration

S Kim - 2023 - deepblue.lib.umich.edu
Hardware accelerators have become permanent features in the post-Dennard computing
landscape, displacing conventional processors for a variety of applications. Not only have …

Logical Accelerators on Manycore Processors

P Bedoukian - 2023 - search.proquest.com
Traditional general-purpose processors emphasize finding parallelism using hardware
control logic, but this logic is energy inefficient and will limit future performance …

A Dynamic Selective Replication Mechanism for the Distributed Storage Structure

S Han, Y Liu, H Cai - … Conference on Natural Computation, Fuzzy Systems …, 2022 - Springer
For the characteristics of reconfigurable array processor (RAP) with large amount of data
access and obvious space-time, a dynamic selective replication (DSR) mechanism of …