Tail-Learning: Adaptive Learning Method for Mitigating Tail Latency in Autonomous Edge Systems

C Zhang, Y Deng, H Zhao, T Chen, S Deng - arXiv preprint arXiv …, 2023 - arxiv.org
In the realm of edge computing, the increasing demand for high Quality of Service (QoS),
particularly in dynamic multimedia streaming applications (eg, Augmented Reality/Virtual …

Scheduling Inference Workloads on Distributed Edge Clusters with Reinforcement Learning

G Castellano, JJ Nieto, J Luque, F Diego… - arXiv preprint arXiv …, 2023 - arxiv.org
Many real-time applications (eg, Augmented/Virtual Reality, cognitive assistance) rely on
Deep Neural Networks (DNNs) to process inference tasks. Edge computing is considered a …

Edge orchestration for latency-sensitive applications

A Rahmanian - 2024 - diva-portal.org
The emerging edge computing infrastructure provides distributed and heterogeneous
resources closer to where data is generated and where end-users are located, thereby …

Q-zilla: A scheduling framework and core microarchitecture for tail-tolerant microservices

A Mirhosseini, BL West, GW Blake… - … Symposium on High …, 2020 - ieeexplore.ieee.org
Managing tail latency is a primary challenge in designing large-scale Internet services.
Queuing is a major contributor to end-to-end tail latency, wherein nominal tasks are …

Scheduling latency-sensitive applications in edge computing

V Scoca, A Aral, I Brandic, R De Nicola, RB Uriarte - 2018 - eprints.cs.univie.ac.at
Edge computing is an emerging technology that aims to include latency-sensitive and data-
intensive applications such as mobile or IoT services, into the cloud ecosystem by placing …

Fast-DRD: Fast decentralized reinforcement distillation for deadline-aware edge computing

S Song, Z Fang, J Jiang - Information processing & management, 2022 - Elsevier
Edge computing has recently gained momentum as it provides computing services for
mobile devices through high-speed networks. In edge computing system optimization, deep …

Collate: Collaborative neural network learning for latency-critical edge systems

S Huai, D Liu, H Kong, X Luo, W Liu… - 2022 IEEE 40th …, 2022 - ieeexplore.ieee.org
Federated Learning (FL) empowers multiple clients to collaboratively learn a model,
enlarging the training data of each client for high accuracy while protecting data privacy …

FrameFeedback: A Closed-Loop Control System for Dynamic Offloading Real-Time Edge Inference

M Jackson, B Ji, DS Nikolopoulos - 2024 IEEE International …, 2024 - ieeexplore.ieee.org
Despite the demand for realtime deep learning applications such as video analytics at the
edge, resource-constrained edge devices can largely not process video streams at their …

A3C-DO: A regional resource scheduling framework based on deep reinforcement learning in edge scenario

J Zou, T Hao, C Yu, H Jin - IEEE Transactions on Computers, 2020 - ieeexplore.ieee.org
Currently, huge amounts of data are produced by edge device. Considering the heavy
burden of network bandwidth and the service delay requirements of delay-sensitive …

Addressing application latency requirements through edge scheduling

A Aral, I Brandic, RB Uriarte, R De Nicola… - Journal of Grid …, 2019 - Springer
Latency-sensitive and data-intensive applications, such as IoT or mobile services, are
leveraged by Edge computing, which extends the cloud ecosystem with distributed …