Exploiting processor heterogeneity in interactive services

J Fried, Z Ruan, A Ousterhout, A Belay - 14th USENIX Symposium on …, 2020 - usenix.org

The conventional wisdom is that CPU resources such as cores, caches, and memory
bandwidth must be partitioned to achieve performance isolation between tasks. Both the …

被引用次数：155 相关文章所有 11 个版本

[PDF] microsoft.com

Few-to-many: Incremental parallelism for reducing tail latency in interactive services

ME Haque, YH Eom, Y He, S Elnikety, R Bianchini… - ACM Sigplan …, 2015 - dl.acm.org

Interactive services, such as Web search, recommendations, games, and finance, must
respond quickly to satisfy customers. Achieving this goal requires optimizing tail (eg, 99th+ …

被引用次数：134 相关文章所有 8 个版本

[PDF] acm.org

Exploiting heterogeneity for tail latency and energy efficiency

ME Haque, Y He, S Elnikety, TD Nguyen… - Proceedings of the 50th …, 2017 - dl.acm.org

Interactive service providers have strict requirements on high-percentile (tail) latency to meet
user expectations. If providers meet tail latency targets with less energy, they increase …

被引用次数：83 相关文章所有 7 个版本

[PDF] mit.edu

CuttleSys: Data-driven resource management for interactive services on reconfigurable multicores

N Kulkarni, G Gonzalez-Pumariega… - 2020 53rd Annual …, 2020 - ieeexplore.ieee.org

Multi-tenancy for latency-critical applications leads to resource interference and
unpredictable performance. Core reconfiguration opens up more opportunities for …

被引用次数：25 相关文章所有 7 个版本

[PDF] psu.edu

Delayed-Dynamic-Selective (DDS) prediction for reducing extreme tail latency in web search

S Kim, Y He, S Hwang, S Elnikety, S Choi - Proceedings of the Eighth …, 2015 - dl.acm.org

A commercial web search engine shards its index among many servers, and therefore the
response time of a search query is dominated by the slowest server that processes the …

被引用次数：70 相关文章所有 7 个版本

[PDF] acm.org

Work stealing for interactive services to meet target latency

J Li, K Agrawal, S Elnikety, Y He, ITA Lee, C Lu… - Proceedings of the 21st …, 2016 - dl.acm.org

Interactive web services increasingly drive critical business workloads such as search,
advertising, games, shopping, and finance. Whereas optimizing parallel programs and …

被引用次数：61 相关文章所有 13 个版本

[PDF] usenix.org

Elfen Scheduling:{Fine-Grain} Principled Borrowing from {Latency-Critical} Workloads Using Simultaneous Multithreading

X Yang, SM Blackburn, KS McKinley - 2016 USENIX Annual Technical …, 2016 - usenix.org

Web services from search to games to stock trading impose strict Service Level Objectives
(SLOs) on tail latency. Meeting these objectives is challenging because the computational …

被引用次数：69 相关文章所有 9 个版本

Q-zilla: A scheduling framework and core microarchitecture for tail-tolerant microservices

A Mirhosseini, BL West, GW Blake… - … Symposium on High …, 2020 - ieeexplore.ieee.org

Managing tail latency is a primary challenge in designing large-scale Internet services.
Queuing is a major contributor to end-to-end tail latency, wherein nominal tasks are …

被引用次数：27 相关文章所有 2 个版本

[PDF] uchicago.edu

CASH: Supporting IaaS customers with a sub-core configurable architecture

Y Zhou, H Hoffmann, D Wentzlaff - ACM SIGARCH Computer …, 2016 - dl.acm.org

Infrastructure as a Service (IaaS) Clouds have grown increasingly important. Recent
architecture designs support IaaS providers through fine-grain configurability, allowing …

被引用次数：49 相关文章所有 15 个版本

[PDF] usenix.org

The {TURBO} diaries: Application-controlled frequency scaling explained

JT Wamhoff, S Diestelhorst, C Fetzer, P Marlier… - 2014 USENIX Annual …, 2014 - usenix.org

Most multi-core architectures nowadays support dynamic voltage and frequency scaling
(DVFS) to adapt their speed to the system's load and save energy. Some recent …

被引用次数：60 相关文章所有 12 个版本

高级搜索

QQ 群