S Feng, X He, KY Chen, L Ke, X Zhang… - Proceedings of the 49th …, 2022 - dl.acm.org
Near-memory processing has been extensively studied to optimize memory intensive workloads. However, none of the proposed designs address sparse matrix transposition, an …
Dynamic adaptation is a post-silicon optimization technique that adapts the hardware to workload phases. However, current adaptive approaches are oblivious to implicit phases …
The real-world use cases of Machine Learning (ML) have exploded over the past few years. However, the current computing infrastructure is insufficient to support all real-world …
Sparse matrix-vector multiplication (SpMV) is a critical building block for iterative graph analytics algorithms. Typically, such algorithms have a varying active vertex set across …
S Kim, M Fayazi, A Daftardar, KY Chen… - IEEE Journal of Solid …, 2022 - ieeexplore.ieee.org
We present Versa, an energy-efficient 36-core systolic multiprocessor with dynamically reconfigurable interconnects and memory. Versa leverages reconfigurable functional units …
J Li, J Zhou, Y Xiong, X Chen… - 2022 IEEE Workshop …, 2022 - ieeexplore.ieee.org
Sampling is an essential part of raw point cloud data processing, such as in the popular PointNet++ scheme. Farthest Point Sampling (FPS), which iteratively samples the farthest …
M Wijtvliet, A Kumar, H Corporaal - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Demand for coarse grain reconfigurable architectures (CGRAs) has significantly increased in recent years as architectures need to be both energy efficient and flexible. However, most …
Hardware acceleration of Artificial Intelligence (AI) workloads has gained widespread popularity with its potential to deliver unprecedented performance and efficiency. An …
Coarse-grained reconfigurable arrays (CGRAs) are domain-specific devices promising both the flexibility of FPGAs and the performance of ASICs. However, with restricted domains …