{OPTIMUSCLOUD}: Heterogeneous configuration optimization for distributed databases in the cloud

A Mahgoub, AM Medoff, R Kumar, S Mitra… - 2020 USENIX Annual …, 2020 - usenix.org
Achieving cost and performance efficiency for cloud-hosted databases requires exploring a
large configuration space, including the parameters exposed by the database along with the …

Scheduling of time-varying workloads using reinforcement learning

SS Mondal, N Sheoran, S Mitra - … of the AAAI Conference on Artificial …, 2021 - ojs.aaai.org
Resource usage of production workloads running on shared compute clusters often fluctuate
significantly across time. While simultaneous spike in the resource usage between two …

Network contention-aware cluster scheduling with reinforcement learning

J Ryu, J Eo - 2023 IEEE 29th International Conference on …, 2023 - ieeexplore.ieee.org
Network contention can significantly degrade training throughput of deep learning jobs in
GPU clusters. In this paper, we present a new approach to mitigate network contention by …

Mimir: Finding Cost-efficient Storage Configurations in the Public Cloud

H Park, GR Ganger, G Amvrosiadis - Proceedings of the 16th ACM …, 2023 - dl.acm.org
Public cloud providers offer a diverse collection of storage types and configurations with
different costs and performance SLAs. As a consequence, it is difficult to select the most cost …

System and method for training and selecting equivalence class prediction modules for resource usage prediction

N Sheoran, S Mitra - US Patent 11,847,496, 2023 - Google Patents
A digital environment includes multiple computing nodes and a scheduling system that
assigns workloads to computing nodes. The scheduling system includes an equivalence …

Towards Designing a Self-Managed Machine Learning Inference Serving System inPublic Cloud

JR Gunasekaran, P Thinakaran, CS Mishra… - arXiv preprint arXiv …, 2020 - arxiv.org
We are witnessing an increasing trend towardsusing Machine Learning (ML) based
prediction systems, span-ning across different application domains, including …

Multi-resource Low-latency Cluster Scheduling without Execution Time Estimation

H Yabuuchi, T Shinagawa - 2020 20th IEEE/ACM International …, 2020 - ieeexplore.ieee.org
Cluster scheduling based on the prior estimation of job execution time is vulnerable to
inaccurate estimates. To avoid performance degradation due to this misestimation, recent …

An Interference Fair Queueing I/O Scheduler under High Performance Processors

S Yan, Y Jiacheng, Y Chun, T Dong - Beijing Da Xue Xue Bao, 2020 - search.proquest.com
High performance processors and systems require high storage bandwidth and efficient
external I/O processing. Many systems using high performance processors server a mixture …