Video summarization using deep neural networks: A survey

A comprehensive review of computer vision in sports: Open issues, future trends and research directions

BT Naik, MF Hashmi, ND Bokde - Applied Sciences, 2022 - mdpi.com

Recent developments in video analysis of sports and computer vision techniques have
achieved significant improvements to enable a variety of critical operations. To provide …

被引用次数：76 相关文章所有 8 个版本

[PDF] arxiv.org

Temporal action segmentation: An analysis of modern techniques

G Ding, F Sener, A Yao - IEEE Transactions on Pattern Analysis …, 2023 - ieeexplore.ieee.org

Temporal action segmentation (TAS) in videos aims at densely identifying video frames in
minutes-long videos with multiple action classes. As a long-range video understanding task …

被引用次数：33 相关文章所有 8 个版本

[PDF] arxiv.org

Socratic models: Composing zero-shot multimodal reasoning with language

A Zeng, M Attarian, B Ichter, K Choromanski… - arXiv preprint arXiv …, 2022 - arxiv.org

Large pretrained (eg," foundation") models exhibit distinct capabilities depending on the
domain of data they are trained on. While these domains are generic, they may only barely …

被引用次数：393 相关文章所有 6 个版本

[PDF] thecvf.com

Timechat: A time-sensitive multimodal large language model for long video understanding

S Ren, L Yao, S Li, X Sun… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

This work proposes TimeChat a time-sensitive multimodal large language model specifically
designed for long video understanding. Our model incorporates two key architectural …

被引用次数：27 相关文章所有 4 个版本

[PDF] thecvf.com

Diffusion action segmentation

D Liu, Q Li, AD Dinh, T Jiang… - Proceedings of the …, 2023 - openaccess.thecvf.com

Temporal action segmentation is crucial for understanding long-form videos. Previous works
on this task commonly adopt an iterative refinement paradigm by using multi-stage models …

被引用次数：39 相关文章所有 5 个版本

[PDF] thecvf.com

Query-dependent video representation for moment retrieval and highlight detection

WJ Moon, S Hyun, SU Park, D Park… - Proceedings of the …, 2023 - openaccess.thecvf.com

Recently, video moment retrieval and highlight detection (MR/HD) are being spotlighted as
the demand for video understanding is drastically increased. The key objective of MR/HD is …

被引用次数：42 相关文章所有 5 个版本

[PDF] thecvf.com

Bridge-prompt: Towards ordinal action understanding in instructional videos

M Li, L Chen, Y Duan, Z Hu, J Feng… - Proceedings of the …, 2022 - openaccess.thecvf.com

Action recognition models have shown a promising capability to classify human actions in
short video clips. In a real scenario, multiple correlated human actions commonly occur in …

被引用次数：63 相关文章所有 6 个版本

Deep multi-scale pyramidal features network for supervised video summarization

H Khan, T Hussain, SU Khan, ZA Khan… - Expert Systems with …, 2024 - Elsevier

Video data are witnessing exponential growth, and extracting summarized information is
challenging. It is always necessary to reduce the load of video traffic for the efficient video …

被引用次数：19 相关文章所有 2 个版本

[PDF] iti.gr

Combining global and local attention with positional encoding for video summarization

E Apostolidis, G Balaouras, V Mezaris… - … on multimedia (ISM), 2021 - ieeexplore.ieee.org

This paper presents a new method for supervised video summarization. To overcome
drawbacks of existing RNN-based summarization architectures, that relate to the modeling …

被引用次数：64 相关文章所有 6 个版本

[PDF] thecvf.com

Joint video summarization and moment localization by cross-task sample transfer

H Jiang, Y Mu - Proceedings of the IEEE/CVF Conference …, 2022 - openaccess.thecvf.com

Video summarization has recently engaged increasing attention in computer vision
communities. However, the scarcity of annotated data has been a key obstacle in this task …

被引用次数：31 相关文章所有 3 个版本

高级搜索

QQ 群