View while moving: Efficient video recognition in long-untrimmed videos

文章

学术资源搜索

获得 3 条结果（用时0.02秒）

我的图书馆

View while moving: Efficient video recognition in long-untrimmed videos

在引用文章中搜索

[PDF] arxiv.org

Longvlm: Efficient long video understanding via large language models

Y Weng, M Han, H He, X Chang, B Zhuang - arXiv preprint arXiv …, 2024 - arxiv.org

Empowered by Large Language Models (LLMs), recent advancements in VideoLLMs have
driven progress in various video understanding tasks. These models encode video …

被引用次数：4 相关文章所有 3 个版本

Efficiently adapting large pre-trained models for real-time violence recognition in smart city surveillance

X Ren, W Fan, Y Wang - Journal of Real-Time Image Processing, 2024 - Springer

Recently, the concept of smart cities has gained prominence, aiming to enhance urban
efficiency, safety, and quality of life through advanced technologies. A critical component of …

AdaViPro: Region-based Adaptive Visual Prompt for Large-Scale Models Adapting

M Yang, Y Tian, L Zhang, X Liang, X Ran… - arXiv preprint arXiv …, 2024 - arxiv.org

Recently, prompt-based methods have emerged as a new alternativeparameter-efficient fine-
tuning'paradigm, which only fine-tunes a small number of additional parameters while …

高级搜索

QQ 群

View while moving: Efficient video recognition in long-untrimmed videos

Longvlm: Efficient long video understanding via large language models

Efficiently adapting large pre-trained models for real-time violence recognition in smart city surveillance

AdaViPro: Region-based Adaptive Visual Prompt for Large-Scale Models Adapting

引用