Frameexit: Conditional early exiting for efficient video recognition

Y Wang, C Yang, S Lan, L Zhu… - … Surveys & Tutorials, 2024 - ieeexplore.ieee.org

The booming development of deep learning applications and services heavily relies on
large deep learning models and massive data in the cloud. However, cloud-based deep …

被引用次数：25 相关文章

[PDF] thecvf.com

Ego-exo4d: Understanding skilled human activity from first-and third-person perspectives

K Grauman, A Westbury, L Torresani… - Proceedings of the …, 2024 - openaccess.thecvf.com

Abstract We present Ego-Exo4D a diverse large-scale multimodal multiview video dataset
and benchmark challenge. Ego-Exo4D centers around simultaneously-captured egocentric …

被引用次数：109 相关文章所有 5 个版本

[PDF] arxiv.org

Dynamic neural networks: A survey

Y Han, G Huang, S Song, L Yang… - IEEE Transactions on …, 2021 - ieeexplore.ieee.org

Dynamic neural network is an emerging research topic in deep learning. Compared to static
models which have fixed computational graphs and parameters at the inference stage …

被引用次数：751 相关文章所有 7 个版本

[PDF] thecvf.com

Ferv39k: A large-scale multi-scene dataset for facial expression recognition in videos

Y Wang, Y Sun, Y Huang, Z Liu… - Proceedings of the …, 2022 - openaccess.thecvf.com

Current benchmarks for facial expression recognition (FER) mainly focus on static images,
while there are limited datasets for FER in videos. It is still ambiguous to evaluate whether …

被引用次数：95 相关文章所有 7 个版本

[PDF] arxiv.org

Longvlm: Efficient long video understanding via large language models

Y Weng, M Han, H He, X Chang, B Zhuang - European Conference on …, 2025 - Springer

Abstract Empowered by Large Language Models (LLMs), recent advancements in Video-
based LLMs (VideoLLMs) have driven progress in various video understanding tasks. These …

被引用次数：16 相关文章所有 3 个版本

Adapting Neural Networks at Runtime: Current Trends in At-Runtime Optimizations for Deep Learning

M Sponner, B Waschneck, A Kumar - ACM Computing Surveys, 2024 - dl.acm.org

Adaptive optimization methods for deep learning adjust the inference task to the current
circumstances at runtime to improve the resource footprint while maintaining the model's …

被引用次数：7 相关文章

[PDF] thecvf.com

Deecap: Dynamic early exiting for efficient image captioning

Z Fei, X Yan, S Wang, Q Tian - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com

Both accuracy and efficiency are crucial for image captioning in real-world scenarios.
Although Transformer-based models have gained significant improved captioning …

被引用次数：45 相关文章所有 4 个版本

[PDF] thecvf.com

Learning semantic associations for mirror detection

H Guan, J Lin, RWH Lau - … of the IEEE/CVF Conference on …, 2022 - openaccess.thecvf.com

Mirrors generally lack a consistent visual appearance, making mirror detection very
challenging. Although recent works that are based on exploiting contextual contrasts and …

被引用次数：42 相关文章所有 4 个版本

[PDF] arxiv.org

Adafocus v2: End-to-end training of spatial dynamic networks for video recognition

Y Wang, Y Yue, Y Lin, H Jiang, Z Lai… - 2022 IEEE/CVF …, 2022 - ieeexplore.ieee.org

Recent works have shown that the computational efficiency of video recognition can be
significantly improved by reducing the spatial redundancy. As a representative work, the …

被引用次数：58 相关文章所有 6 个版本

[PDF] thecvf.com

The dark side of dynamic routing neural networks: Towards efficiency backdoor injection

S Chen, H Chen, M Haque, C Liu… - Proceedings of the …, 2023 - openaccess.thecvf.com

Recent advancements in deploying deep neural networks (DNNs) on resource-constrained
devices have generated interest in input-adaptive dynamic neural networks (DyNNs) …

被引用次数：15 相关文章所有 3 个版本

高级搜索

QQ 群