[PDF][PDF] Deepweave: Accelerating job completion time with deep reinforcement learning-based coflow scheduling

P Sun, Z Guo, J Wang, J Li, J Lan, Y Hu - Proceedings of the Twenty-Ninth …, 2021 - ijcai.org
To improve the processing efficiency of jobs in distributed computing, the concept of coflow
is proposed. A coflow is a collection of flows that are semantically correlated in a multi-stage …

Flexible cyclic queuing and forwarding for time-sensitive software-defined networks

Y Huang, S Wang, X Zhang, T Huang… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Time-Sensitive Networking (TSN) is emerging to support critical real-time applications in
Industry 4.0. Recent proposals leverage Cyclic Queuing and Forwarding (CQF) to achieve …

Online routing and scheduling for time-sensitive networks

Y Huang, S Wang, T Huang, B Wu… - 2021 IEEE 41st …, 2021 - ieeexplore.ieee.org
Recent proposals leverage Time-Aware Shaper (TAS) to achieve precise transmission in
Time-Sensitive Networking (TSN). However, most of the proposals require the information of …

Glint: Decentralized federated graph learning with traffic throttling and flow scheduling

T Liu, P Li, Y Gu - … IEEE/ACM 29th International Symposium on …, 2021 - ieeexplore.ieee.org
Federated learning has been proposed as a promising distributed machine learning
paradigm with strong privacy protection on training data. Existing work mainly focuses on …

Co-Scheduler: A coflow-aware data-parallel job scheduler in hybrid electrical/optical datacenter networks

Z Li, H Shen - IEEE/ACM Transactions on Networking, 2022 - ieeexplore.ieee.org
To support higher demand for datacenter networks, networking researchers have proposed
hybrid electrical/optical datacenter networks (Hybrid-DCN) that leverages optical circuit …

A distributed framework for task offloading in edge computing networks of arbitrary topology

B Liu, Y Cao, Y Zhang, T Jiang - IEEE Transactions on Wireless …, 2020 - ieeexplore.ieee.org
An important issue in an edge computing (EC) network is to increase the utilities of the end
users concurrently accessing the computation resources. In this paper, we consider the task …

Coflow scheduling in data centers: routing and bandwidth allocation

L Shi, Y Liu, J Zhang… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
In distributed computing frameworks like MapReduce, Spark, and Dyrad, a coflow is a set of
flows transferring data between two stages of a job. The job cannot start its next stage unless …

Information cofreshness-aware grant assignment and transmission scheduling for internet of things

YH Chiang, H Lin, Y Ji - IEEE Internet of Things Journal, 2021 - ieeexplore.ieee.org
The proliferation of Internet of Things (IoT) applications has prompted the continuous
increase of research efforts in recent years. In light of the diversified use cases and service …

Optimizing task placement and online scheduling for distributed GNN training acceleration

Z Luo, Y Bao, C Wu - IEEE INFOCOM 2022-IEEE Conference …, 2022 - ieeexplore.ieee.org
Training Graph Neural Networks (GNN) on large graphs is resource-intensive and time-
consuming, mainly due to the large graph data that cannot be fit into the memory of a single …

Online scheduling of coflows by attention-empowered scalable deep reinforcement learning

X Wang, H Shen - Future Generation Computer Systems, 2023 - Elsevier
With the abstraction of parallel data transmission flows being a coflow, data transmissions in
large-scale computing jobs can be modeled by a coflow directed acyclic graph (coflow DAG) …