A comprehensive review of computer vision in sports: Open issues, future trends and research directions

BT Naik, MF Hashmi, ND Bokde - Applied Sciences, 2022 - mdpi.com
Recent developments in video analysis of sports and computer vision techniques have
achieved significant improvements to enable a variety of critical operations. To provide …

Temporal action segmentation: An analysis of modern techniques

G Ding, F Sener, A Yao - IEEE Transactions on Pattern Analysis …, 2023 - ieeexplore.ieee.org
Temporal action segmentation (TAS) in videos aims at densely identifying video frames in
minutes-long videos with multiple action classes. As a long-range video understanding task …

Socratic models: Composing zero-shot multimodal reasoning with language

A Zeng, M Attarian, B Ichter, K Choromanski… - arXiv preprint arXiv …, 2022 - arxiv.org
Large pretrained (eg," foundation") models exhibit distinct capabilities depending on the
domain of data they are trained on. While these domains are generic, they may only barely …

Timechat: A time-sensitive multimodal large language model for long video understanding

S Ren, L Yao, S Li, X Sun… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
This work proposes TimeChat a time-sensitive multimodal large language model specifically
designed for long video understanding. Our model incorporates two key architectural …

Diffusion action segmentation

D Liu, Q Li, AD Dinh, T Jiang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Temporal action segmentation is crucial for understanding long-form videos. Previous works
on this task commonly adopt an iterative refinement paradigm by using multi-stage models …

Query-dependent video representation for moment retrieval and highlight detection

WJ Moon, S Hyun, SU Park, D Park… - Proceedings of the …, 2023 - openaccess.thecvf.com
Recently, video moment retrieval and highlight detection (MR/HD) are being spotlighted as
the demand for video understanding is drastically increased. The key objective of MR/HD is …

Bridge-prompt: Towards ordinal action understanding in instructional videos

M Li, L Chen, Y Duan, Z Hu, J Feng… - Proceedings of the …, 2022 - openaccess.thecvf.com
Action recognition models have shown a promising capability to classify human actions in
short video clips. In a real scenario, multiple correlated human actions commonly occur in …

Deep multi-scale pyramidal features network for supervised video summarization

H Khan, T Hussain, SU Khan, ZA Khan… - Expert Systems with …, 2024 - Elsevier
Video data are witnessing exponential growth, and extracting summarized information is
challenging. It is always necessary to reduce the load of video traffic for the efficient video …

Combining global and local attention with positional encoding for video summarization

E Apostolidis, G Balaouras, V Mezaris… - … on multimedia (ISM), 2021 - ieeexplore.ieee.org
This paper presents a new method for supervised video summarization. To overcome
drawbacks of existing RNN-based summarization architectures, that relate to the modeling …

Joint video summarization and moment localization by cross-task sample transfer

H Jiang, Y Mu - Proceedings of the IEEE/CVF Conference …, 2022 - openaccess.thecvf.com
Video summarization has recently engaged increasing attention in computer vision
communities. However, the scarcity of annotated data has been a key obstacle in this task …