Improving resource efficiency at scale with heracles

S Mittal - ACM Computing Surveys (CSUR), 2017 - dl.acm.org

As the number of on-chip cores and memory demands of applications increase, judicious
management of cache resources has become not merely attractive but imperative. Cache …

被引用次数：85 相关文章所有 3 个版本

[PDF] acm.org

Sequoia: Enabling quality-of-service in serverless computing

A Tariq, A Pahl, S Nimmagadda, E Rozner… - Proceedings of the 11th …, 2020 - dl.acm.org

Serverless computing is a rapidly growing paradigm that easily harnesses the power of the
cloud. With serverless computing, developers simply provide an event-driven function to …

被引用次数：118 相关文章所有 4 个版本

[PDF] usenix.org

{PerfIso}: Performance isolation for commercial {Latency-Sensitive} services

C Iorgulescu, R Azimi, Y Kwon, S Elnikety… - 2018 USENIX Annual …, 2018 - usenix.org

Large commercial latency-sensitive services, such as web search, run on dedicated clusters
provisioned for peak load to ensure responsiveness and tolerate data center outages. As a …

被引用次数：147 相关文章所有 14 个版本

[PDF] academia.edu

Understanding, predicting and scheduling serverless workloads under partial interference

L Zhao, Y Yang, Y Li, X Zhou, K Li - Proceedings of the International …, 2021 - dl.acm.org

Interference among distributed cloud applications can be classified into three types: full,
partial and zero. While prior research merely focused on full interference, the partial …

被引用次数：48 相关文章所有 4 个版本

[PDF] semanticscholar.org

Smartharvest: Harvesting idle cpus safely and efficiently in the cloud

Y Wang, K Arya, M Kogias, M Vanga… - Proceedings of the …, 2021 - dl.acm.org

We can increase the efficiency of public cloud datacenters by harvesting allocated but
temporarily idling CPU cores from customer virtual machines (VMs) to run batch or analytics …

被引用次数：69 相关文章所有 3 个版本

[PDF] microsoft.com

Tr-spark: Transient computing for big data analytics

Y Yan, Y Gao, Y Chen, Z Guo, B Chen… - Proceedings of the …, 2016 - dl.acm.org

Large-scale public cloud providers invest billions of dollars into their cloud infrastructure and
operate hundreds of thousands of servers across the globe. For various reasons, much of …

被引用次数：121 相关文章所有 3 个版本

[PDF] usenix.org

Canvas: Isolated and adaptive swapping for {Multi-Applications} on remote memory

C Wang, Y Qiao, H Ma, S Liu, W Chen… - … USENIX Symposium on …, 2023 - usenix.org

Remote memory techniques for datacenter applications have recently gained a great deal of
popularity. Existing remote memory techniques focus on the efficiency of a single application …

被引用次数：32 相关文章所有 13 个版本

[PDF] tuwien.ac.at

Sloc: Service level objectives for next generation cloud computing

S Nastic, A Morichetta, T Pusztai… - IEEE Internet …, 2020 - ieeexplore.ieee.org

Since the emergence of cloud computing service level objectives (SLOs) and service level
agreements (SLAs) have put themselves forward as one of the key enablers for cloud's on …

被引用次数：55 相关文章所有 7 个版本

[PDF] acm.org

Topology-aware gpu scheduling for learning workloads in cloud environments

M Amaral, J Polo, D Carrera, S Seelam… - Proceedings of the …, 2017 - dl.acm.org

Recent advances in hardware, such as systems with multiple GPUs and their availability in
the cloud, are enabling deep learning in various domains including health care …

被引用次数：85 相关文章所有 7 个版本

[PDF] usenix.org

Iron: Isolating network-based {CPU} in container environments

J Khalid, E Rozner, W Felter, C Xu… - … USENIX Symposium on …, 2018 - usenix.org

Containers are quickly increasing in popularity as the mechanism to deploy computation in
the cloud. In order to provide consistent and reliable performance, cloud providers must …

被引用次数：67 相关文章所有 16 个版本

高级搜索

QQ 群