A survey of techniques for cache partitioning in multicore processors

S Mittal - ACM Computing Surveys (CSUR), 2017 - dl.acm.org
As the number of on-chip cores and memory demands of applications increase, judicious
management of cache resources has become not merely attractive but imperative. Cache …

Sequoia: Enabling quality-of-service in serverless computing

A Tariq, A Pahl, S Nimmagadda, E Rozner… - Proceedings of the 11th …, 2020 - dl.acm.org
Serverless computing is a rapidly growing paradigm that easily harnesses the power of the
cloud. With serverless computing, developers simply provide an event-driven function to …

{PerfIso}: Performance isolation for commercial {Latency-Sensitive} services

C Iorgulescu, R Azimi, Y Kwon, S Elnikety… - 2018 USENIX Annual …, 2018 - usenix.org
Large commercial latency-sensitive services, such as web search, run on dedicated clusters
provisioned for peak load to ensure responsiveness and tolerate data center outages. As a …

Understanding, predicting and scheduling serverless workloads under partial interference

L Zhao, Y Yang, Y Li, X Zhou, K Li - Proceedings of the International …, 2021 - dl.acm.org
Interference among distributed cloud applications can be classified into three types: full,
partial and zero. While prior research merely focused on full interference, the partial …

Smartharvest: Harvesting idle cpus safely and efficiently in the cloud

Y Wang, K Arya, M Kogias, M Vanga… - Proceedings of the …, 2021 - dl.acm.org
We can increase the efficiency of public cloud datacenters by harvesting allocated but
temporarily idling CPU cores from customer virtual machines (VMs) to run batch or analytics …

Tr-spark: Transient computing for big data analytics

Y Yan, Y Gao, Y Chen, Z Guo, B Chen… - Proceedings of the …, 2016 - dl.acm.org
Large-scale public cloud providers invest billions of dollars into their cloud infrastructure and
operate hundreds of thousands of servers across the globe. For various reasons, much of …

Canvas: Isolated and adaptive swapping for {Multi-Applications} on remote memory

C Wang, Y Qiao, H Ma, S Liu, W Chen… - … USENIX Symposium on …, 2023 - usenix.org
Remote memory techniques for datacenter applications have recently gained a great deal of
popularity. Existing remote memory techniques focus on the efficiency of a single application …

Sloc: Service level objectives for next generation cloud computing

S Nastic, A Morichetta, T Pusztai… - IEEE Internet …, 2020 - ieeexplore.ieee.org
Since the emergence of cloud computing service level objectives (SLOs) and service level
agreements (SLAs) have put themselves forward as one of the key enablers for cloud's on …

Topology-aware gpu scheduling for learning workloads in cloud environments

M Amaral, J Polo, D Carrera, S Seelam… - Proceedings of the …, 2017 - dl.acm.org
Recent advances in hardware, such as systems with multiple GPUs and their availability in
the cloud, are enabling deep learning in various domains including health care …

Iron: Isolating network-based {CPU} in container environments

J Khalid, E Rozner, W Felter, C Xu… - … USENIX Symposium on …, 2018 - usenix.org
Containers are quickly increasing in popularity as the mechanism to deploy computation in
the cloud. In order to provide consistent and reliable performance, cloud providers must …