Yggdrasil: Reducing Network I/O Tax with (CXL-Based) Distributed Shared Memory

W Tang, Y Han, T Ai, G Li, B Yu, X Yang - Proceedings of the 53rd …, 2024 - dl.acm.org
In communication-intensive applications that run on hosts with high-speed network
hardware, a common challenge arises from the significant burden placed on the native …

Itoyori: Reconciling Global Address Space and Global Fork-Join Task Parallelism

S Shiina, K Taura - Proceedings of the International Conference for High …, 2023 - dl.acm.org
This paper introduces Itoyori, a task-parallel runtime system designed to tackle the
challenge of scaling task parallelism (more specifically, nested fork-join parallelism) beyond …

[PDF][PDF] SNIC-DSM: SmartNIC based DSM Infrastructure for Heterogeneous-ISA Machines

H Ramesh - 2023 - vtechworks.lib.vt.edu
Heterogeneous computing is increasingly used in today's datacenters to meet the increasing
computational demands of applications. Heterogeneous hardware typically includes CPUs …

[PDF][PDF] CRIU-RTX: Remote Thread eXecution using Checkpoint/Restore in Userspace

MHN Mohamed - 2023 - vtechworks.lib.vt.edu
Scaling up application performance on single high-end machines is increasingly becoming
difficult due to scalability challenges of processor interconnects, cache coherence protocols …

[PDF][PDF] fDSM: An FPGA-Accelerated Distributed Shared Memory for Heterogeneous Instruction-Set-Architecture Hardware

NR VSathish - 2022 - vtechworks.lib.vt.edu
Due to the diminishing relevance of Moore's Law, traditional multi-core systems are
increasingly struggling to meet the computational demands of many emerging workloads …