Efficient scheduling policies for {Microsecond-Scale} tasks

S McClure, A Ousterhout, S Shenker… - … USENIX Symposium on …, 2022 - usenix.org
Datacenter operators today strive to support microsecond-latency applications while also
using their limited CPU resources as efficiently as possible. To achieve this, several recent …

Cornflakes: Zero-copy serialization for microsecond-scale networking

D Raghavan, S Ravi, G Yuan, P Thaker… - Proceedings of the 29th …, 2023 - dl.acm.org
Data serialization is critical for many datacenter applications, but the memory copies
required to move application data into packets are costly. Recent zero-copy APIs expose …

RAMBDA: RDMA-driven Acceleration Framework for Memory-intensive µs-scale Datacenter Applications

Y Yuan, J Huang, Y Sun, T Wang… - … Symposium on High …, 2023 - ieeexplore.ieee.org
Responding to the" datacenter tax" and" killer microseconds" problems for memory-intensive
datacenter applications, diverse solutions including Smart NIC-based ones have been …

Profiling hyperscale big data processing

A Gonzalez, A Kolli, S Khan, S Liu, V Dadu… - Proceedings of the 50th …, 2023 - dl.acm.org
Computing demand continues to grow exponentially, largely driven by" big data" processing
on hyperscale data stores. At the same time, the slowdown in Moore's law is leading the …

Achieving microsecond-scale tail latency efficiently with approximate optimal scheduling

R Iyer, M Unal, M Kogias, G Candea - Proceedings of the 29th …, 2023 - dl.acm.org
Datacenter applications expect microsecond-scale service times and tightly bound tail
latency, with future workloads expected to be even more demanding. To address this …

Parslo: A gradient descent-based approach for near-optimal partial SLO allotment in microservices

A Mirhosseini, S Elnikety, TF Wenisch - … of the ACM Symposium on Cloud …, 2021 - dl.acm.org
Modern cloud services are implemented as graphs of loosely-coupled microservices to
improve programmability, reliability, and scalability. Service Level Objectives (SLOs) define …

{RingLeader}: efficiently Offloading {Intra-Server} Orchestration to {NICs}

J Lin, A Cardoza, T Khan, Y Ro, BE Stephens… - … USENIX Symposium on …, 2023 - usenix.org
Careful orchestration of requests at a datacenter server is crucial to meet tight tail latency
requirements and ensure high throughput and optimal CPU utilization. Orchestration is multi …

Rebooting virtual memory with midgard

S Gupta, A Bhattacharyya, Y Oh… - 2021 ACM/IEEE 48th …, 2021 - ieeexplore.ieee.org
Computer systems designers are building cache hierarchies with higher capacity to capture
the ever-increasing working sets of modern workloads. Cache hierarchies with higher …

Turbo: Smartnic-enabled dynamic load balancing of µs-scale rpcs

H Seyedroudbari, S Vanavasam… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Online services are decomposed into fine-grained software components that communicate
over the network using fine-grained Remote Procedure Calls (RPCs). Inter-server …

Altocumulus: Scalable scheduling for nanosecond-scale remote procedure calls

J Zhao, I Uwizeyimana, K Ganesan… - 2022 55th IEEE/ACM …, 2022 - ieeexplore.ieee.org
Online services in modern datacenters use Remote Procedure Calls (RPCs) to
communicate between different software layers. Despite RPCs using just a few small …