Designing distributed systems using approximate synchrony in data center networks

DRK Ports, J Li, V Liu, NK Sharma… - … USENIX Symposium on …, 2015 - usenix.org
Distributed systems are traditionally designed independently from the underlying network,
making worst-case assumptions (eg, complete asynchrony) about its behavior. However …

Consensus in a box: Inexpensive coordination in hardware

Z István, D Sidler, G Alonso, M Vukolic - 13th USENIX Symposium on …, 2016 - usenix.org
Consensus mechanisms for ensuring consistency are some of the most expensive
operations in managing large amounts of data. Often, there is a trade off that involves …

Sundial: Fault-tolerant clock synchronization for datacenters

Y Li, G Kumar, H Hariharan, H Wassel… - … USENIX symposium on …, 2020 - usenix.org
Clock synchronization is critical for many datacenter applications such as distributed
transactional databases, consistent snapshots, and network telemetry. As applications have …

Aquila: A unified, low-latency fabric for datacenter networks

D Gibson, H Hariharan, E Lance, M McLaren… - … USENIX Symposium on …, 2022 - usenix.org
Datacenter workloads have evolved from the data intensive, loosely-coupled workloads of
the past decade to more tightly coupled ones, wherein ultra-low latency communication is …

Clock-RSM: Low-latency inter-datacenter state machine replication using loosely synchronized physical clocks

J Du, D Sciascia, S Elnikety… - 2014 44th Annual …, 2014 - ieeexplore.ieee.org
This paper proposes Clock-RSM, a new state machine replication protocol that uses loosely
synchronized physical clocks to totally order commands for geo-replicated services. Clock …

{NetChain}:{Scale-Free}{Sub-RTT} coordination

X Jin, X Li, H Zhang, N Foster, J Lee, R Soulé… - … USENIX Symposium on …, 2018 - usenix.org
Coordination services are a fundamental building block of modern cloud systems, providing
critical functionalities like configuration management and distributed locking. The major …

Datacenter {RPCs} can be general and fast

A Kalia, M Kaminsky, D Andersen - 16th USENIX Symposium on …, 2019 - usenix.org
It is commonly believed that datacenter networking software must sacrifice generality to
attain high performance. The popularity of specialized distributed systems designed …

Scafida: A scale-free network inspired data center architecture

L Gyarmati, TA Trinh - ACM SIGCOMM Computer Communication …, 2010 - dl.acm.org
Data centers have a crucial role in current Internet architecture supporting content-centric
networking. State-of-the-art data centers have different architectures like fat-tree, DCell, or …

HovercRaft: Achieving scalability and fault-tolerance for microsecond-scale datacenter services

M Kogias, E Bugnion - … of the Fifteenth European Conference on …, 2020 - dl.acm.org
Cloud platform services must simultaneously be scalable, meet low tail latency service-level
objectives, and be resilient to a combination of software, hardware, and network failures …

[PDF][PDF] Sharing the data center network

A Shieh, S Kandula, A Greenberg, C Kim… - 8th USENIX Symposium …, 2011 - usenix.org
While today's data centers are multiplexed across many non-cooperating applications, they
lack effective means to share their network. Relying on TCP's congestion control, as we …