Supporting address translation for accelerator-centric architectures

B Peccerillo, M Mannino, A Mondelli… - Journal of Systems …, 2022 - Elsevier

In recent years, the limits of the multicore approach emerged in the so-called “dark silicon”
issue and diminishing returns of an ever-increasing core count. Hardware manufacturers …

被引用次数：76 相关文章所有 7 个版本

[PDF] arxiv.org

Gemmini: Enabling systematic deep-learning architecture evaluation via full-stack integration

H Genc, S Kim, A Amid, A Haj-Ali, V Iyer… - 2021 58th ACM/IEEE …, 2021 - ieeexplore.ieee.org

DNN accelerators are often developed and evaluated in isolation without considering the
cross-stack, system-level effects in real-world environments. This makes it difficult to …

被引用次数：239 相关文章所有 7 个版本

Processors, methods, and systems with a configurable spatial accelerator

KE Fleming, KD Glossop, SC Steely Jr, J Tang… - US Patent …, 2020 - Google Patents

2017-08-09 Assigned to INTEL CORPORATION reassignment INTEL CORPORATION
ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors …

被引用次数：149 相关文章所有 4 个版本

[PDF] acm.org

Mosaic: a GPU memory manager with application-transparent support for multiple page sizes

R Ausavarungnirun, J Landgraf, V Miller… - Proceedings of the 50th …, 2017 - dl.acm.org

Contemporary discrete GPUs support rich memory management features such as virtual
memory and demand paging. These features simplify GPU programming by providing a …

被引用次数：154 相关文章所有 26 个版本

[PDF] usenix.org

Telekine: Secure computing with cloud {GPUs}

T Hunt, Z Jia, V Miller, A Szekely, Y Hu… - … USENIX Symposium on …, 2020 - usenix.org

GPUs have become ubiquitous in the cloud due to the dramatic performance gains they
enable in domains such as machine learning and computer vision. However, offloading …

被引用次数：92 相关文章所有 11 个版本

[PDF] cam.ac.uk

Thunderclap: Exploring vulnerabilities in operating system IOMMU protection via DMA from untrustworthy peripherals

AT Markettos, C Rothwell, BF Gutstein, A Pearce… - 2019 - repository.cam.ac.uk

Thunderclap: Exploring Vulnerabilities in Operating System IOMMU Protection via DMA
from Untrustworthy Peripherals Page 1 Thunderclap: Exploring Vulnerabilities in Operating …

被引用次数：105 相关文章所有 12 个版本

[PDF] gatech.edu

Batch-aware unified memory management in GPUs for irregular workloads

H Kim, J Sim, P Gera, R Hadidi, H Kim - Proceedings of the Twenty-Fifth …, 2020 - dl.acm.org

While unified virtual memory and demand paging in modern GPUs provide convenient
abstractions to programmers for working with large-scale applications, they come at a …

被引用次数：77 相关文章所有 3 个版本

[PDF] acm.org

A framework for memory oversubscription management in graphics processing units

C Li, R Ausavarungnirun, CJ Rossbach… - Proceedings of the …, 2019 - dl.acm.org

Modern discrete GPUs support unified memory and demand paging. Automatic
management of data movement between CPU memory and GPU memory dramatically …

被引用次数：94 相关文章所有 10 个版本

[PDF] acm.org

Mask: Redesigning the gpu memory hierarchy to support multi-application concurrency

R Ausavarungnirun, V Miller, J Landgraf… - ACM SIGPLAN …, 2018 - dl.acm.org

Graphics Processing Units (GPUs) exploit large amounts of threadlevel parallelism to
provide high instruction throughput and to efficiently hide long-latency stalls. The resulting …

被引用次数：112 相关文章所有 26 个版本

[PDF] arxiv.org

G10: Enabling an efficient unified gpu memory and storage architecture with smart tensor migrations

H Zhang, Y Zhou, Y Xue, Y Liu, J Huang - … of the 56th Annual IEEE/ACM …, 2023 - dl.acm.org

To break the GPU memory wall for scaling deep learning workloads, a variety of architecture
and system techniques have been proposed recently. Their typical approaches include …

被引用次数：11 相关文章所有 9 个版本

高级搜索

QQ 群