Reexamining Direct Cache Access to Optimize {I/O} Intensive Applications for Multi-hundred-gigabit Networks

A Farshin, A Roozbeh, GQ Maguire Jr… - 2020 USENIX Annual …, 2020 - usenix.org
Memory access is the major bottleneck in realizing multi-hundred-gigabit networks with
commodity hardware, hence it is essential to make good use of cache memory that is a …

When idling is ideal: Optimizing tail-latency for heavy-tailed datacenter workloads with perséphone

HM Demoulin, J Fried, I Pedisich, M Kogias… - Proceedings of the …, 2021 - dl.acm.org
This paper introduces Perséphone, a kernel-bypass OS scheduler designed to minimize tail
latency for applications executing at microsecond-scale and exhibiting wide service time …

Syrup: User-defined scheduling across the stack

K Kaffes, JT Humphries, D Mazières… - Proceedings of the ACM …, 2021 - dl.acm.org
Suboptimal scheduling decisions in operating systems, networking stacks, and application
runtimes are often responsible for poor application performance, including higher latency …

{AlNiCo}:{SmartNIC-accelerated} contention-aware request scheduling for transaction processing

J Li, Y Lu, Q Wang, J Lin, Z Yang, J Shu - 2022 USENIX Annual …, 2022 - usenix.org
High-performance transaction processing needs to schedule numerous requests from the
network. However, such request scheduling comes with costs of complex information …

PacketMill: toward per-Core 100-Gbps networking

A Farshin, T Barbette, A Roozbeh… - Proceedings of the 26th …, 2021 - dl.acm.org
We present PacketMill, a system for optimizing software packet processing, which (i)
introduces a new model to efficiently manage packet metadata and (ii) employs code …

What you need to know about (smart) network interface cards

GP Katsikas, T Barbette, M Chiesa, D Kostić… - … Conference on Passive …, 2021 - Springer
Network interface cards (NICs) are fundamental components of modern high-speed
networked systems, supporting multi-100 Gbps speeds and increasing programmability …

Transparent Multicore Scaling of Single-Threaded Network Functions

L Yan, Y Pan, D Zhou, G Candea… - Proceedings of the …, 2024 - dl.acm.org
This paper presents NFOS, a programming model, runtime, and profiler for productively
developing software network functions (NFs) that scale on multicore machines. Writing …

Automatic Parallelization of Software Network Functions

F Pereira, FMV Ramos, L Pedrosa - 21st USENIX Symposium on …, 2024 - usenix.org
Software network functions (NFs) trade-off flexibility and ease of deployment for an
increased challenge of performance. The traditional way to increase NF performance is by …

Packet order matters! improving application performance by deliberately delaying packets

H Ghasemirahni, T Barbette, GP Katsikas… - … USENIX Symposium on …, 2022 - usenix.org
Data centers increasingly deploy commodity servers with high-speed network interfaces to
enable low-latency communication. However, achieving low latency at high data rates …

Retina: analyzing 100GbE traffic on commodity hardware

G Wan, F Gong, T Barbette, Z Durumeric - Proceedings of the ACM …, 2022 - dl.acm.org
As network speeds have increased to over 100 Gbps, operators and researchers have lost
the ability to easily ask complex questions of reassembled and parsed network traffic. In this …