- 学术资源搜索

A taxonomy of live migration management in cloud computing

T He, R Buyya - ACM Computing Surveys, 2023 - dl.acm.org

Cloud Data Centers have become the key infrastructure for providing services. Instance
migration across different computing nodes in edge and cloud computing is essential to …

被引用次数：33 相关文章所有 3 个版本

[PDF] acm.org

In-depth analyses of unified virtual memory system for GPU accelerated computing

T Allen, R Ge - Proceedings of the International Conference for High …, 2021 - dl.acm.org

The abstraction of a shared memory space over separate CPU and GPU memory domains
has eased the burden of portability for many HPC codebases. However, users pay for the …

被引用次数：39 相关文章所有 5 个版本

[PDF] ugent.be

SAC: Sharing-aware caching in multi-chip GPUs

S Zhang, M Naderan-Tahan, M Jahre… - Proceedings of the 50th …, 2023 - dl.acm.org

Bandwidth non-uniformity in multi-chip GPUs poses a major design challenge for its last-
level cache (LLC) architecture. Whereas a memory-side LLC caches data from the local …

被引用次数：9 相关文章所有 4 个版本

[PDF] acm.org

IDYLL: Enhancing Page Translation in Multi-GPUs via Light Weight PTE Invalidations

B Li, Y Guo, Y Wang, A Jaleel, J Yang… - Proceedings of the 56th …, 2023 - dl.acm.org

Multi-GPU systems have emerged as a desirable platform to deliver high computing
capabilities and large memory capacity to accommodate large dataset sizes. However …

被引用次数：6 相关文章所有 8 个版本

[PDF] nsf.gov

Locality-centric data and threadblock management for massive GPUs

M Khairy, V Nikiforov, D Nellans… - 2020 53rd Annual IEEE …, 2020 - ieeexplore.ieee.org

Recent work has shown that building GPUs with hundreds of SMs in a single monolithic chip
will not be practical due to slowing growth in transistor density, low chip yields, and …

被引用次数：34 相关文章所有 8 个版本

[PDF] nsf.gov

Improving address translation in multi-gpus via sharing and spilling aware tlb design

B Li, J Yin, Y Zhang, X Tang - MICRO-54: 54th Annual IEEE/ACM …, 2021 - dl.acm.org

In recent years, the ever-growing application complexity and input dataset sizes have driven
the popularity of multi-GPU systems as a desirable computing platform for many application …

被引用次数：21 相关文章所有 7 个版本

[PDF] google.com

Gps: A global publish-subscribe model for multi-gpu memory management

H Muthukrishnan, D Lustig, D Nellans… - MICRO-54: 54th Annual …, 2021 - dl.acm.org

Suboptimal management of memory and bandwidth is one of the primary causes of low
performance on systems comprising multiple GPUs. Existing memory management solutions …

被引用次数：17 相关文章所有 3 个版本

[PDF] researchgate.net

Demystifying gpu uvm cost with deep runtime and workload analysis

T Allen, R Ge - 2021 IEEE International Parallel and Distributed …, 2021 - ieeexplore.ieee.org

With GPUs becoming ubiquitous in HPC systems, NVIDIA's Unified Virtual Memory (UVM) is
being adopted as a measure to simplify porting of complex codes to GPU platforms by …

被引用次数：29 相关文章所有 3 个版本

Snakebyte: A tlb design with adaptive and recursive page merging in gpus

J Lee, JM Lee, Y Oh, WJ Song… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org

This paper presents an address translation scheme in GPUs named SnakeByte that can
dynamically manage variable-sized pages and maximize TLB reach by recursively merging …

被引用次数：10 相关文章所有 4 个版本

[PDF] acm.org

Photon: A fine-grained sampled simulation methodology for GPU workloads

C Liu, Y Sun, TE Carlson - Proceedings of the 56th Annual IEEE/ACM …, 2023 - dl.acm.org

GPUs, due to their massively-parallel computing architectures, provide high performance for
data-parallel applications. However, existing GPU simulators are too slow to enable …

被引用次数：6 相关文章所有 6 个版本

高级搜索

QQ 群

A taxonomy of live migration management in cloud computing

In-depth analyses of unified virtual memory system for GPU accelerated computing

SAC: Sharing-aware caching in multi-chip GPUs

IDYLL: Enhancing Page Translation in Multi-GPUs via Light Weight PTE Invalidations

Locality-centric data and threadblock management for massive GPUs

Improving address translation in multi-gpus via sharing and spilling aware tlb design

Gps: A global publish-subscribe model for multi-gpu memory management

Demystifying gpu uvm cost with deep runtime and workload analysis

Snakebyte: A tlb design with adaptive and recursive page merging in gpus

Photon: A fine-grained sampled simulation methodology for GPU workloads

引用