End-edge-cloud collaborative computing for deep learning: A comprehensive survey

Y Wang, C Yang, S Lan, L Zhu… - … Surveys & Tutorials, 2024 - ieeexplore.ieee.org
The booming development of deep learning applications and services heavily relies on
large deep learning models and massive data in the cloud. However, cloud-based deep …

Ego-exo4d: Understanding skilled human activity from first-and third-person perspectives

K Grauman, A Westbury, L Torresani… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract We present Ego-Exo4D a diverse large-scale multimodal multiview video dataset
and benchmark challenge. Ego-Exo4D centers around simultaneously-captured egocentric …

Dynamic neural networks: A survey

Y Han, G Huang, S Song, L Yang… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org
Dynamic neural network is an emerging research topic in deep learning. Compared to static
models which have fixed computational graphs and parameters at the inference stage …

Ferv39k: A large-scale multi-scene dataset for facial expression recognition in videos

Y Wang, Y Sun, Y Huang, Z Liu… - Proceedings of the …, 2022 - openaccess.thecvf.com
Current benchmarks for facial expression recognition (FER) mainly focus on static images,
while there are limited datasets for FER in videos. It is still ambiguous to evaluate whether …

Longvlm: Efficient long video understanding via large language models

Y Weng, M Han, H He, X Chang, B Zhuang - European Conference on …, 2025 - Springer
Abstract Empowered by Large Language Models (LLMs), recent advancements in Video-
based LLMs (VideoLLMs) have driven progress in various video understanding tasks. These …

Adapting Neural Networks at Runtime: Current Trends in At-Runtime Optimizations for Deep Learning

M Sponner, B Waschneck, A Kumar - ACM Computing Surveys, 2024 - dl.acm.org
Adaptive optimization methods for deep learning adjust the inference task to the current
circumstances at runtime to improve the resource footprint while maintaining the model's …

Deecap: Dynamic early exiting for efficient image captioning

Z Fei, X Yan, S Wang, Q Tian - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Both accuracy and efficiency are crucial for image captioning in real-world scenarios.
Although Transformer-based models have gained significant improved captioning …

Learning semantic associations for mirror detection

H Guan, J Lin, RWH Lau - … of the IEEE/CVF Conference on …, 2022 - openaccess.thecvf.com
Mirrors generally lack a consistent visual appearance, making mirror detection very
challenging. Although recent works that are based on exploiting contextual contrasts and …

Adafocus v2: End-to-end training of spatial dynamic networks for video recognition

Y Wang, Y Yue, Y Lin, H Jiang, Z Lai… - 2022 IEEE/CVF …, 2022 - ieeexplore.ieee.org
Recent works have shown that the computational efficiency of video recognition can be
significantly improved by reducing the spatial redundancy. As a representative work, the …

The dark side of dynamic routing neural networks: Towards efficiency backdoor injection

S Chen, H Chen, M Haque, C Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recent advancements in deploying deep neural networks (DNNs) on resource-constrained
devices have generated interest in input-adaptive dynamic neural networks (DyNNs) …