Learning to configure converters in hybrid switching data center networks

J Zheng, Z Du, Z Zha, Z Yang, X Gao… - IEEE/ACM Transactions …, 2023 - ieeexplore.ieee.org
Data centers heavily rely on scale-out architectures like fat-tree, BCube and VL2 to
accommodate a large number of commodity servers. Since the traditional electrical network …

Rearchitecting datacenter networks: A new paradigm with optical core and optical edge

S Das, A Silva, TSE Ng - IEEE INFOCOM 2024-IEEE …, 2024 - ieeexplore.ieee.org
All-optical circuit-switching (OCS) technology is the key to design energy-efficient and high-
performance datacenter network (DCN) architectures for the future. However, existing round …

A survey of faults and fault-injection techniques in edge computing systems

M Pourreza, P Narasimhan - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Edge computing has emerged in recent years to reduce latency, conserve bandwidth, and
enhance privacy for applications. As more edge computing applications are being deployed …

Survey of fault management techniques for edge-enabled distributed metaverse applications

S Shaikh, M Jammal - Computer Networks, 2024 - Elsevier
The metaverse, envisioned as a vast, distributed virtual world, relies on edge computing for
low-latency data processing. However, ensuring fault tolerance–the system's ability to …

Graph-Driven Insights for Heterogeneous Data Center Networks: Leveraging Converters With a Novel Graph Transformer Approach

X Ji, C Xu, Z Zhang, L Zhong, K Gao… - IEEE Transactions …, 2024 - ieeexplore.ieee.org
Data centers extensively employ topologies such as CLOS, BCube, and Xpander to manage
their large networks of devices. Data center networks (DCN), which rely on traditional …

ALEPH: Accelerating Distributed Training With eBPF-Based Hierarchical Gradient Aggregation

P Yang, H Xu, G Zhao, Q Zhang, J Liu… - IEEE/ACM Transactions …, 2024 - computer.org
Distributed training includes two important operations: gradient transmission and gradient
aggregation, which will consume massive bandwidth and computing resources. To achieve …

EINS: Edge-Cloud Deep Model Inference with Network-Efficiency Schedule in Serverless

S Peng, Y Lin, W Chen, Y Tang… - 2024 27th International …, 2024 - ieeexplore.ieee.org
Model inference in edge is often regarded as an effective method to alleviate high latency
and enhance data privacy in edge-cloud collaborative computing environment. In this paper …

[PDF][PDF] Accelerating Collective Communications with Mutual Benefits of Optical Rackless DC and In-Network Computing

W Wu, X Chen, Z Ma, X Xie, K Meng, W Wang, Z Zhu - Computing - zuqingzhu.info
We propose a novel architecture to explore the mutual benefits of optical rackless data
center (ORDC) and in-network computing for accelerating collective communications. It …