Content-aware Input Scaling and Deep Learning Computation Offloading for Low-Latency Embedded Vision

O Prabhune, T Chen, Y Kim - Proceedings of the IEEE/CVF …, 2024 - openaccess.thecvf.com
Deploying deep learning (DL) models for visual recognition on embedded systems is often
constrained by their limited compute power and storage capacity and has stringent latency …

Split Computing With Scalable Feature Compression for Visual Analytics on the Edge

Z Yuan, S Rawlekar, S Garg, E Erkip… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Running deep visual analytics models for real-time applications is challenging for mobile
devices. Offloading the computation to edge server can mitigate computation bottleneck at …

Deep contextualized compressive offloading for images

B Chen, Z Yan, H Guo, Z Yang, A Ali-Eldin… - Proceedings of the 19th …, 2021 - dl.acm.org
Recent years have witnessed sensors becoming an indispensable part of our life with the
camera being one of the most popular and widely deployed sensors. The camera gives rise …

Adaptive Compression Offloading and Resource Allocation for Edge Vision Computing

W Xiao, Y Hao, J Liang, L Hu… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
The rapid progress in edge computing (EC) and 5G wireless communication technology has
opened up novel opportunities for intelligent applications driven by Deep Neural Networks …

Deepeye: Resource efficient local execution of multiple deep vision models using wearable commodity hardware

A Mathur, ND Lane, S Bhattacharya, A Boran… - Proceedings of the 15th …, 2017 - dl.acm.org
Wearable devices with built-in cameras present interesting opportunities for users to capture
various aspects of their daily life and are potentially also useful in supporting users with low …

Supervised compression for resource-constrained edge computing systems

Y Matsubara, R Yang, M Levorato… - Proceedings of the …, 2022 - openaccess.thecvf.com
There has been much interest in deploying deep learning algorithms on low-powered
devices, including smartphones, drones, and medical sensors. However, full-scale deep …

Improving the accuracy-latency trade-off of edge-cloud computation offloading for deep learning services

X Zhao, M Hosseinzadeh, N Hudson… - 2020 IEEE …, 2020 - ieeexplore.ieee.org
Offloading tasks to the edge or the Cloud has the potential to improve accuracy of
classification and detection tasks as more powerful hardware and machine learning models …

Head network distillation: Splitting distilled deep neural networks for resource-constrained edge computing systems

Y Matsubara, D Callegaro, S Baidya, M Levorato… - IEEE …, 2020 - ieeexplore.ieee.org
As the complexity of Deep Neural Network (DNN) models increases, their deployment on
mobile devices becomes increasingly challenging, especially in complex vision tasks such …

Offloaded execution of deep learning inference at edge: Challenges and insights

S Dey, J Mondal, A Mukherjee - 2019 IEEE International …, 2019 - ieeexplore.ieee.org
Efforts to leverage the benefits of Deep Learning (DL) models for performing inference in
resource constrained embedded devices is very popular nowadays. Researchers worldwide …

Resource and Bandwidth-Aware Video Analytics with Adaptive Offloading

L Zhang, Y Zhong, J Liu, L Cui - … on Mobile Ad Hoc and Smart …, 2023 - ieeexplore.ieee.org
To enable computation-intensive video analytics, streaming video data and offloading
computation from the source to the inference server running deep neural networks has now …