Jellyfish: Timely inference serving for dynamic edge networks

V Nigade, P Bauszat, H Bal… - 2022 IEEE Real-Time …, 2022 - ieeexplore.ieee.org
While high accuracy is of paramount importance for deep learning (DL) inference, serving
inference requests on time is equally critical but has not been carefully studied especially …

Interference-aware scheduling for inference serving

D Mendoza, F Romero, Q Li, NJ Yadwadkar… - Proceedings of the 1st …, 2021 - dl.acm.org
Machine learning inference applications have proliferated through diverse domains such as
healthcare, security, and analytics. Recent work has proposed inference serving systems for …

Towards transmission-friendly and robust cnn models over cloud and device

C Ding, Z Lu, F Juefei-Xu, VN Boddeti… - IEEE Transactions on …, 2022 - ieeexplore.ieee.org
Deploying deep convolutional neural network (CNN) models on ubiquitous Internet of
Things (IoT) devices has attracted much attention from industry and academia since it …

Litereconfig: Cost and content aware reconfiguration of video object detection systems for mobile gpus

R Xu, J Lee, P Wang, S Bagchi, Y Li… - Proceedings of the …, 2022 - dl.acm.org
An adaptive video object detection system selects different execution paths at runtime,
based on video content and available resources, so as to maximize accuracy under a target …

Serverless data science-are we there yet? a case study of model serving

Y Wu, TTA Dinh, G Hu, M Zhang, YM Chee… - Proceedings of the 2022 …, 2022 - dl.acm.org
Machine learning (ML) is an important part of modern data science applications. Data
scientists today have to manage the end-to-end ML life cycle that includes both model …

TASTI: semantic indexes for machine learning-based queries over unstructured data

D Kang, J Guibas, PD Bailis, T Hashimoto… - Proceedings of the 2022 …, 2022 - dl.acm.org
Unstructured data (eg, video or text) is now commonly queried by using computationally
expensive deep neural networks or human labelers to produce structured information, eg …

[图书][B] Edge/Fog Computing Paradigm: The Concept, Platforms and Applications.

P Raj, K Saini, C Surianarayanan - 2022 - books.google.com
Page 1 Advances in COMPUTERS Volume 127 Edge/Fog Computing Paradigm: The
Concept, Platforms and Applications Edited by PETHURU RAJ CHELLLAH, KAVITA SAINI …

Adaptive computation offloading for mobile augmented reality

J Ren, L Gao, X Wang, M Ma, G Qiu, H Wang… - Proceedings of the …, 2021 - dl.acm.org
Augmented reality (AR) underpins many emerging mobile applications, but it increasingly
requires more computation power for better machine understanding and user experience …

Neural networks meet physical networks: Distributed inference between edge devices and the cloud

SP Chinchali, E Cidon, E Pergament, T Chu… - Proceedings of the 17th …, 2018 - dl.acm.org
We believe that most future video uploaded over the network will be consumed by machines
for sensing tasks such as automated surveillance and mapping rather than for human …

Edge-coordinated energy-efficient video analytics for digital twin in 6G

P Yang, J Hou, L Yu, W Chen, Y Wu - China Communications, 2023 - ieeexplore.ieee.org
Camera networks are essential to constructing fast and accurate mapping between virtual
and physical space for digital twin. In this paper, with the aim of developing energy-efficient …