Resource management in clouds: Survey and research challenges

B Jennings, R Stadler - Journal of Network and Systems Management, 2015 - Springer
Resource management in a cloud environment is a hard problem, due to: the scale of
modern data centers; the heterogeneity of resource types and their interdependencies; the …

A survey on data center networking (DCN): Infrastructure and operations

W Xia, P Zhao, Y Wen, H Xie - IEEE communications surveys & …, 2016 - ieeexplore.ieee.org
Data centers (DCs), owing to the exponential growth of Internet services, have emerged as
an irreplaceable and crucial infrastructure to power this ever-growing trend. A DC typically …

RDMA over commodity ethernet at scale

C Guo, H Wu, Z Deng, G Soni, J Ye, J Padhye… - Proceedings of the …, 2016 - dl.acm.org
Over the past one and half years, we have been using RDMA over commodity Ethernet
(RoCEv2) to support some of Microsoft's highly-reliable, latency-sensitive services. This …

Re-architecting datacenter networks and stacks for low latency and high performance

M Handley, C Raiciu, A Agache, A Voinescu… - Proceedings of the …, 2017 - dl.acm.org
Modern datacenter networks provide very high capacity via redundant Clos topologies and
low switch latency, but transport protocols rarely deliver matching performance. We present …

CONGA: Distributed congestion-aware load balancing for datacenters

M Alizadeh, T Edsall, S Dharmapurikar… - Proceedings of the …, 2014 - dl.acm.org
We present the design, implementation, and evaluation of CONGA, a network-based
distributed congestion-aware load balancing mechanism for datacenters. CONGA exploits …

Hula: Scalable load balancing using programmable data planes

N Katta, M Hira, C Kim, A Sivaraman… - Proceedings of the …, 2016 - dl.acm.org
Datacenter networks employ multi-rooted topologies (eg, Leaf-Spine, Fat-Tree) to provide
large bisection bandwidth. These topologies use a large degree of multipathing, and need a …

pFabric: Minimal near-optimal datacenter transport

M Alizadeh, S Yang, M Sharif, S Katti… - ACM SIGCOMM …, 2013 - dl.acm.org
In this paper we present pFabric, a minimalistic datacenter transport design that provides
near theoretically optimal flow completion times even at the 99th percentile for short flows …

Presto: Edge-based load balancing for fast datacenter networks

K He, E Rozner, K Agarwal, W Felter, J Carter… - ACM SIGCOMM …, 2015 - dl.acm.org
Datacenter networks deal with a variety of workloads, ranging from latency-sensitive small
flows to bandwidth-hungry large flows. Load balancing schemes based on flow hashing, eg …

How hard can it be? designing and implementing a deployable multipath {TCP}

C Raiciu, C Paasch, S Barre, A Ford, M Honda… - 9th USENIX symposium …, 2012 - usenix.org
How Hard Can It Be? Designing and Implementing a Deployable Multipath TCP Page 1 How
Hard Can It Be? Designing and Implementing a Deployable Multipath TCP Costin Raiciu† …

Reproducible network experiments using container-based emulation

N Handigol, B Heller, V Jeyakumar, B Lantz… - Proceedings of the 8th …, 2012 - dl.acm.org
In an ideal world, all research papers would be runnable: simply click to replicate all results,
using the same setup as the authors. One approach to enable runnable network systems …