过去一年中添加的文章,按日期排序

WASP: Exploiting GPU Pipeline Parallelism with Hardware-Accelerated Automatic Warp Specialization

NC Crago, S Damani, K Sankaralingam… - … Architecture (HPCA), 2024 - ieeexplore.ieee.org
131 天前 - … Finally, we design and implement a compiler that can … All warps in processing blocks
share a physical register … a program into sub-contexts for execution on parallel processors …

Retargeting and Respecializing GPU Workloads for Performance Portability

IR Ivanov, O Zinenko, J Domke, T Endo… - … Symposium on Code …, 2024 - ieeexplore.ieee.org
133 天前 - … thread does, and the amount of memory and register … of the generated code against
the baseline GPU compiler (… regardless of the compiler thanks to the shared frontand back-…