Long memory latency and limited throughput become performance bottlenecks of GPGPU applications. The latency takes hundreds of cycles which is difficult to be hidden by simply …
M Khairy, AG Wassal, M Zahran - Journal of Parallel and Distributed …, 2019 - Elsevier
With the skyrocketing advances of process technology, the increased need to process huge amount of data, and the pivotal need for power efficiency, the usage of Graphics Processing …
Graphics processing units (GPUs) include a large amount of hardware resources for parallel thread executions. However, the resources are not fully utilized during runtime, and …
K Knobe, V Natarajan - Third Symposium on the Frontiers of …, 1990 - computer.org
This paper presents a pre-execution approach for improving GPU performance, called P- mode (pre-execution mode). GPUs utilize a number of concurrent threads for hiding …
MK Yoon, Y Oh, SH Kim, S Lee, D Kim… - IEEE Transactions on …, 2017 - ieeexplore.ieee.org
This paper conducts a detailed study of the factors affecting the operation stalls in terms of the fetch group size on the warp scheduler of GPUs. Throughout this paper, we reveal that …
Massively parallel processing devices, like Graphics Processing Units (GPUs), have the ability to accelerate highly parallel workloads in an energy-efficient manner. However …
GB Kim, JM Kim, CH Kim - Journal of The Korea Society of …, 2019 - koreascience.kr
Abstract LRR (Loose Round Robin) warp scheduling policy for GPU architecture results in high warp-level parallelism and balanced loads across multiple warps. However, traditional …
MAR Abdeen - 2018 IEEE 16th Intl Conf on Dependable …, 2018 - ieeexplore.ieee.org
In this paper we present a distributed architecture to address the problems of managing, tracking, and predicting astray person in large crowds. Our case study considers the …
AE Gruber - US Patent 11,640,647, 2023 - Google Patents
US11640647B2 - Methods and apparatus for intra-wave texture looping - Google Patents US11640647B2 - Methods and apparatus for intra-wave texture looping - Google Patents …