Thread clustering: sharing-aware scheduling on SMP-CMP-SMT multiprocessors

D Tam, R Azimi, M Stumm - ACM SIGOPS Operating Systems Review, 2007 - dl.acm.org
The major chip manufacturers have all introduced chip multiprocessing (CMP) and
simultaneous multithreading (SMT) technology into their processing units. As a result, even …

Warped-slicer: Efficient intra-SM slicing through dynamic resource partitioning for GPU multiprogramming

Q Xu, H Jeon, K Kim, WW Ro… - ACM SIGARCH Computer …, 2016 - dl.acm.org
As technology scales, GPUs are forecasted to incorporate an ever-increasing amount of
computing resources to support thread-level parallelism. But even with the best effort …

Analysis and approximation of optimal co-scheduling on chip multiprocessors

Y Jiang, X Shen, J Chen, R Tripathi - Proceedings of the 17th …, 2008 - dl.acm.org
Cache sharing among processors is important for Chip Multiprocessors to reduce inter-
thread latency, but also brings cache contention, degrading program performance …

Defuzzification of fuzzy intervals

R Zhao, R Govind - Fuzzy sets and systems, 1991 - Elsevier
This paper deals with defuzzification of fuzzy intervals classified as regular or irregular.
Based on the defuzzification method used, a fuzzy interval is regular if its membership …

Does cache sharing on modern CMP matter to the performance of contemporary multithreaded programs?

EZ Zhang, Y Jiang, X Shen - ACM Sigplan Notices, 2010 - dl.acm.org
Most modern Chip Multiprocessors (CMP) feature shared cache on chip. For multithreaded
applications, the sharing reduces communication latency among co-running threads, but …

Probabilistic job symbiosis modeling for SMT processor scheduling

S Eyerman, L Eeckhout - ACM SIGARCH Computer Architecture News, 2010 - dl.acm.org
Symbiotic job scheduling boosts simultaneous multithreading (SMT) processor performance
by co-scheduling jobs that havecompatible'demands on the processor's shared resources …

Co-run scheduling with power cap on integrated cpu-gpu systems

Q Zhu, B Wu, X Shen, L Shen… - 2017 IEEE International …, 2017 - ieeexplore.ieee.org
This paper presents the first systematic study on co-scheduling independent jobs on
integrated CPU-GPU systems with power caps considered. It reveals the performance …

Machine learning for load balancing in the linux kernel

J Chen, SS Banerjee, ZT Kalbarczyk… - Proceedings of the 11th …, 2020 - dl.acm.org
The OS load balancing algorithm governs the performance gains provided by a
multiprocessor computer system. The Linux's Completely Fair Scheduler (CFS) scheduler …

Resource-aware task scheduling

M Tillenius, E Larsson, RM Badia… - ACM Transactions on …, 2015 - dl.acm.org
Dependency-aware task-based parallel programming models have proven to be successful
for developing efficient application software for multicore-based computer architectures. The …

Combining locality analysis with online proactive job co-scheduling in chip multiprocessors

Y Jiang, K Tian, X Shen - … Conference, HiPEAC 2010, Pisa, Italy, January …, 2010 - Springer
The shared-cache contention on Chip Multiprocessors causes performance degradation to
applications and hurts system fairness. Many previously proposed solutions schedule …