Open problems in queueing theory inspired by datacenter computing

M Harchol-Balter - Queueing Systems, 2021 - Springer
Datacenter operations today provide a plethora of new queueing and scheduling problems.
The notion of a “job” has become more general and multi-dimensional. The ways in which …

Optimal scheduling in the multiserver-job model under heavy traffic

I Grosof, Z Scully, M Harchol-Balter… - Proceedings of the ACM …, 2022 - dl.acm.org
Multiserver-job systems, where jobs require concurrent service at many servers, occur
widely in practice. Essentially all of the theoretical work on multiserver-job systems focuses …

Scalable load balancing in networked systems: A survey of recent advances

MV der Boor, SC Borst, JSH Van Leeuwaarden… - SIAM Review, 2022 - SIAM
In this survey we provide an overview of recent advances on scalable load balancing
schemes which provide favorable delay performance and yet require minimal …

A new toolbox for scheduling theory

Z Scully - ACM SIGMETRICS Performance Evaluation Review, 2023 - dl.acm.org
Queueing delays are ubiquitous in many domains, including computer systems, service
systems, communication networks, supply chains, and transportation. Queueing and …

Heavy-Traffic Optimal Size-and State-Aware Dispatching

R Xie, I Grosof, Z Scully - Proceedings of the ACM on Measurement and …, 2024 - dl.acm.org
Dispatching systems, where arriving jobs are immediately assigned to one of multiple
queues, are ubiquitous in computer systems and service systems. A natural and practically …

Performance of the Gittins policy in the G/G/1 and G/G/k, with and without setup times

Y Hong, Z Scully - ACM SIGMETRICS Performance Evaluation Review, 2023 - dl.acm.org
We consider the classic problem of preemptively scheduling jobs of unknown size (aka
service time) in a queue to minimize mean number-in-system, or equivalently mean …

Optimal Scheduling in Multiserver Queues

I Grosof - ACM SIGMETRICS Performance Evaluation Review, 2024 - dl.acm.org
Scheduling theory is a key tool for reducing latency (ie response time) in queueing systems.
Scheduling, ie choosing the order in which to serve jobs, can reduce response time by an …

Mean field analysis of join-below-threshold load balancing for resource sharing servers

IA Horváth, Z Scully, B Van Houdt - … of the ACM on Measurement and …, 2019 - dl.acm.org
Load balancing plays a crucial role in many large scale computer systems. Much prior work
has focused on systems with First-Come-First-Served (FCFS) servers. However, servers in …

Online fair allocation of perishable resources

S Banerjee, C Hssaine, SR Sinclair - ACM SIGMETRICS Performance …, 2023 - dl.acm.org
We consider a practically motivated variant of the canonical online fair allocation problem: a
decision-maker has a budget of resources to allocate over a fixed number of rounds. Each …

Strongly Tail-Optimal Scheduling in the Light-Tailed M/G/1

G Yu, Z Scully - Proceedings of the ACM on Measurement and Analysis …, 2024 - dl.acm.org
We study the problem of scheduling jobs in a queueing system, specifically an M/G/1 with
light-tailed job sizes, to asymptotically optimize the response time tail. This means …