Communication-efficient edge AI: Algorithms and systems

Y Shi, K Yang, T Jiang, J Zhang… - … Surveys & Tutorials, 2020 - ieeexplore.ieee.org
Artificial intelligence (AI) has achieved remarkable breakthroughs in a wide range of fields,
ranging from speech processing, image classification to drug discovery. This is driven by the …

A comprehensive review of model compression techniques in machine learning

PV Dantas, W Sabino da Silva Jr, LC Cordeiro… - Applied …, 2024 - Springer
This paper critically examines model compression techniques within the machine learning
(ML) domain, emphasizing their role in enhancing model efficiency for deployment in …

Edge-AI-driven framework with efficient mobile network design for facial expression recognition

Y Wu, L Zhang, Z Gu, H Lu, S Wan - ACM Transactions on Embedded …, 2023 - dl.acm.org
Facial Expression Recognition (FER) in the wild poses significant challenges due to realistic
occlusions, illumination, scale, and head pose variations of the facial images. In this article …

Distredge: Speeding up convolutional neural network inference on distributed edge devices

X Hou, Y Guan, T Han, N Zhang - 2022 IEEE International …, 2022 - ieeexplore.ieee.org
As the number of edge devices with computing resources (eg, embedded GPUs, mobile
phones, and laptops) in-creases, recent studies demonstrate that it can be beneficial to col …

Compacting deep neural networks for Internet of Things: Methods and applications

K Zhang, H Ying, HN Dai, L Li, Y Peng… - IEEE Internet of …, 2021 - ieeexplore.ieee.org
Deep neural networks (DNNs) have shown great success in completing complex tasks.
However, DNNs inevitably bring high computational cost and storage consumption due to …

Low latency deep learning inference model for distributed intelligent IoT edge clusters

S Naveen, MR Kounte, MR Ahmed - IEEE Access, 2021 - ieeexplore.ieee.org
Edge computing is a new paradigm enabling intelligent applications for the Internet of
Things (IoT) using mobile, low-cost IoT devices embedded with data analytics. Due to the …

Distributed deep convolutional neural networks for the internet-of-things

S Disabato, M Roveri, C Alippi - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Severe constraints on memory and computation characterizing the Internet-of-Things (IoT)
units may prevent the execution of Deep Learning (DL)-based solutions, which typically …

Self-aware distributed deep learning framework for heterogeneous IoT edge devices

Y Jin, J Cai, J Xu, Y Huan, Y Yan, B Huang… - Future Generation …, 2021 - Elsevier
Implementing artificial intelligence (AI) in the Internet of Things (IoT) involves a move from
the cloud to the heterogeneous and low-power edge, following an urgent demand for …

Memory optimization at edge for distributed convolution neural network

S Naveen, MR Kounte - Transactions on Emerging …, 2022 - Wiley Online Library
Abstract Internet of Things (IoT) edge intelligence has emerged by optimizing the deep
learning (DL) models deployed on resource‐constraint devices for quick decision‐making …

Communication-aware DNN pruning

T Jian, D Roy, B Salehi, N Soltani… - … -IEEE Conference on …, 2023 - ieeexplore.ieee.org
We propose a Communication-aware Pruning (CaP) algorithm, a novel distributed inference
framework for distributing DNN computations across a physical network. Departing from …