Intelligent queue management of open vSwitch in multi-tenant data center

H Ma, X Luo, D Xu - Future Generation Computer Systems, 2023 - Elsevier
Multi-tenant data centers (MTDCs) host numerous applications with dominant transport layer
protocol TCP, hence the performance of TCP matters a lot. However, it is difficult for the …

[HTML][HTML] ExDe: Design space exploration of scheduler architectures and mechanisms for serverless data-processing

S Talluri, N Herbst, C Abad, T De Matteis… - Future Generation …, 2024 - Elsevier
Serverless computing is increasingly used for data-processing applications in both science
and business domains. At the core of serverless data-processing systems is the scheduler …

Disaggregated Multi-Tower: Topology-aware Modeling Technique for Efficient Large Scale Recommendation

L Luo, B Zhang, M Tsang, Y Ma… - Proceedings of …, 2024 - proceedings.mlsys.org
We study a mismatch between the deep learning recommendation models' flat architecture,
common distributedtraining paradigm and hierarchical data center topology. To address the …

MCCS: A Service-based Approach to Collective Communication for Multi-Tenant Cloud

Y Wu, Y Xu, J Chen, Z Wang, Y Zhang… - Proceedings of the …, 2024 - dl.acm.org
Performance of collective communication is critical for distributed systems. Using libraries to
implement collective communication algorithms is not a good fit for a multi-tenant cloud …

Overlay-based Decentralized Federated Learning in Bandwidth-limited Networks

Y Huang, T Sun, T He - arXiv preprint arXiv:2408.04705, 2024 - arxiv.org
The emerging machine learning paradigm of decentralized federated learning (DFL) has the
promise of greatly boosting the deployment of artificial intelligence (AI) by directly learning …

gPerfIsol: GNN-based Rate-Limits Allocation for Performance Isolation in Multi-tenant Cloud

B Nougnanke, J Loye, JF Baffier… - … 27th Conference on …, 2024 - ieeexplore.ieee.org
Performance Isolation in Multi-Tenant Cloud Data Centers (MTCDCs) consists of a set of
mechanisms to make sure tenants' use of resources does not impact other tenants. In this …

Towards a Manageable Intra-Host Network

X Kong, J Lou, W Bai, NS Kim, D Zhuo - … of the 19th Workshop on Hot …, 2023 - dl.acm.org
Intra-host networks, including heterogeneous devices and interconnect fabrics, have
become increasingly complex and crucial. However, intra-host networks today do not …

SmartTags: bridging applications and network for proactive performance management

A Munir, SH Mortazavi, MM Bahnasy… - Proceedings of the …, 2022 - dl.acm.org
Sudden changes in the applications and events in the network are often related. Many of the
datacenter applications go through sudden state changes (such as a query-response in …