Open problems in queueing theory inspired by datacenter computing

M Harchol-Balter - Queueing Systems, 2021 - Springer
Datacenter operations today provide a plethora of new queueing and scheduling problems.
The notion of a “job” has become more general and multi-dimensional. The ways in which …

Optimal scheduling in the multiserver-job model under heavy traffic

I Grosof, Z Scully, M Harchol-Balter… - Proceedings of the ACM …, 2022 - dl.acm.org
Multiserver-job systems, where jobs require concurrent service at many servers, occur
widely in practice. Essentially all of the theoretical work on multiserver-job systems focuses …

The Gittins policy is nearly optimal in the M/G/k under extremely general conditions

Z Scully, I Grosof, M Harchol-Balter - … of the ACM on Measurement and …, 2020 - dl.acm.org
The Gittins scheduling policy minimizes the mean response in the single-server M/G/1
queue in a wide variety of settings. Most famously, Gittins is optimal when preemption is …

WCFS: A new framework for analyzing multiserver systems

I Grosof, M Harchol-Balter, A Scheller-Wolf - Queueing Systems, 2022 - Springer
Multiserver queueing systems are found at the core of a wide variety of practical systems.
Many important multiserver models have a previously-unexplained similarity: identical mean …

Online evolutionary batch size orchestration for scheduling deep learning workloads in GPU clusters

Z Bian, S Li, W Wang, Y You - … of the International Conference for High …, 2021 - dl.acm.org
Efficient GPU resource scheduling is essential to maximize resource utilization and save
training costs for the increasing amount of deep learning workloads in shared GPU clusters …

A new toolbox for scheduling theory

Z Scully - ACM SIGMETRICS Performance Evaluation Review, 2023 - dl.acm.org
Queueing delays are ubiquitous in many domains, including computer systems, service
systems, communication networks, supply chains, and transportation. Queueing and …

[HTML][HTML] A reinforcement learning algorithm for scheduling parallel processors with identical speedup functions

F Ziaei, M Ranjbar - Machine Learning with Applications, 2023 - Elsevier
In this study, we investigate a real-time system where computationally intensive tasks are
executed using cloud computing platforms in data centers. These data centers are designed …

Optimal multiserver scheduling with unknown job sizes in heavy traffic

Z Scully, I Grosof, M Harchol-Balter - ACM SIGMETRICS Performance …, 2020 - dl.acm.org
We consider scheduling to minimize mean response time of the M/G/k queue with unknown
job sizes. In the singleserver k= 1 case, the optimal policy is the Gittins policy, but it is not …

Performance of the Gittins policy in the G/G/1 and G/G/k, with and without setup times

Y Hong, Z Scully - ACM SIGMETRICS Performance Evaluation Review, 2023 - dl.acm.org
We consider the classic problem of preemptively scheduling jobs of unknown size (aka
service time) in a queue to minimize mean number-in-system, or equivalently mean …

SRPT scheduling discipline in many-server queues with impatient customers

J Dong, R Ibrahim - Management Science, 2021 - pubsonline.informs.org
The shortest-remaining-processing-time (SRPT) scheduling policy has been extensively
studied, for more than 50 years, in single-server queues with infinitely patient jobs. Yet …