Clite: Efficient and qos-aware co-location of multiple latency-critical jobs for warehouse scale computers

T Patel, D Tiwari - 2020 IEEE International Symposium on High …, 2020 - ieeexplore.ieee.org
Large-scale data centers run latency-critical jobs with quality-of-service (QoS) requirements,
and throughput-oriented background jobs, which need to achieve high perfor-mance …

Unsupervised learning approach for web application auto-decomposition into microservices

M Abdullah, W Iqbal, A Erradi - Journal of Systems and Software, 2019 - Elsevier
Nowadays, large monolithic web applications are manually decomposed into microservices
for many reasons including adopting a modern architecture to ease maintenance and …

Tarema: Adaptive resource allocation for scalable scientific workflows in heterogeneous clusters

J Bader, L Thamsen, S Kulagina, J Will… - … Conference on Big …, 2021 - ieeexplore.ieee.org
Scientific workflow management systems like Nextflow support large-scale data analysis by
abstracting away the details of scientific workflows. In these systems, workflows consist of …

Finding the right cloud configuration for analytics clusters

M Bilal, M Canini, R Rodrigues - … of the 11th ACM Symposium on Cloud …, 2020 - dl.acm.org
Finding good cloud configurations for deploying a single distributed system is already a
challenging task, and it becomes substantially harder when a data analytics cluster is …

Satori: efficient and fair resource partitioning by sacrificing short-term benefits for long-term gains

RB Roy, T Patel, D Tiwari - 2021 ACM/IEEE 48th Annual …, 2021 - ieeexplore.ieee.org
Multi-core architectures have enabled data centers to increasingly co-locate multiple jobs to
improve resource utilization and lower the operational cost. Unfortunately, naively co …

Whence to learn? transferring knowledge in configurable systems using beetle

R Krishna, V Nair, P Jamshidi… - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
As software systems grow in complexity and the space of possible configurations increases
exponentially, finding the near-optimal configuration of a software system becomes …

Ribbon: cost-effective and qos-aware deep learning model inference using a diverse pool of cloud computing instances

B Li, RB Roy, T Patel, V Gadepally, K Gettings… - Proceedings of the …, 2021 - dl.acm.org
Deep learning model inference is a key service in many businesses and scientific discovery
processes. This paper introduces Ribbon, a novel deep learning inference serving system …

With great freedom comes great opportunity: Rethinking resource allocation for serverless functions

M Bilal, M Canini, R Fonseca, R Rodrigues - Proceedings of the …, 2023 - dl.acm.org
Current serverless offerings give users limited flexibility for configuring the resources
allocated to their function invocations. This simplifies the interface for users to deploy server …

Predicting the performance-cost trade-off of applications across multiple systems

A Nassereldine, S Diab, M Baydoun… - 2023 IEEE/ACM …, 2023 - ieeexplore.ieee.org
In modern computing environments, users may have multiple systems accessible to them
such as local clusters, private clouds, or public clouds. This abundance of choices makes it …

Do the best cloud configurations grow on trees? an experimental evaluation of black box algorithms for optimizing cloud workloads

M Bilal, M Serafini, M Canini, R Rodrigues - 2020 - repository.kaust.edu.sa
Cloud configuration optimization is the procedure to determine the number and the type of
instances to use when deploying an application in cloud environments, given a cost or …