作者
Peter Bodík, Ishai Menache, Mosharaf Chowdhury, Pradeepkumar Mani, David A Maltz, Ion Stoica
发表日期
2012/8/13
期刊
ACM SIGCOMM Computer Communication Review
卷号
42
期号
4
页码范围
431-442
出版商
ACM
简介
Datacenter networks have been designed to tolerate failures of network equipment and provide sufficient bandwidth. In practice, however, failures and maintenance of networking and power equipment often make tens to thousands of servers unavailable, and network congestion can increase service latency. Unfortunately, there exists an inherent tradeoff between achieving high fault tolerance and reducing bandwidth usage in network core; spreading servers across fault domains improves fault tolerance, but requires additional bandwidth, while deploying servers together reduces bandwidth usage, but also decreases fault tolerance. We present a detailed analysis of a large-scale Web application and its communication patterns. Based on that, we propose and evaluate a novel optimization framework that achieves both high fault tolerance and significantly reduces bandwidth usage in the network core by …
引用总数
20122013201420152016201720182019202020212022202320245143735443521181017672
学术搜索中的文章
P Bodík, I Menache, M Chowdhury, P Mani, DA Maltz… - ACM SIGCOMM Computer Communication Review, 2012