作者
Virajith Jalaparti, Peter Bodik, Srikanth Kandula, Ishai Menache, Mikhail Rybalkin, Chenyu Yan
发表日期
2013
研讨会论文
Sigcomm
出版商
ACM
简介
We found that interactive services at Bing have highly variable datacenter-side processing latencies because their processing consists of many sequential stages, parallelization across 10s-1000s of servers and aggregation of responses across the network. To improve the tail latency of such services, we use a few building blocks: reissuing laggards elsewhere in the cluster, new policies to return incomplete results and speeding up laggards by giving them more resources. Combining these building blocks to reduce the overall latency is non-trivial because for the same amount of resource (e.g., number of reissues), different stages improve their latency by different amounts. We present Kwiken, a framework that takes an end-to-end view of latency improvements and costs. It decomposes the problem of minimizing latency over a general processing DAG into a manageable optimization over individual stages. Through …
引用总数
201320142015201620172018201920202021202220232024241429272029221410145
学术搜索中的文章
V Jalaparti, P Bodik, S Kandula, I Menache, M Rybalkin… - ACM SIGCOMM Computer Communication Review, 2013