Momentum cross-modal contrastive learning for video moment retrieval

D Han, X Cheng, N Guo, X Ye… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org
Video moment retrieval aims to locate the timestamps best matching the query description
within an untrimmed video. However, existing video moment retrieval approaches typically …

Revisiting Unsupervised Temporal Action Localization: The Primacy of High-Quality Actionness and Pseudolabels

H Jiang, H Tang, M Yan, J Zhang, M Xu, Y Hu… - Proceedings of the …, 2024 - dl.acm.org
Recently, temporal action localization (TAL) methods, especially the weakly-supervised and
unsupervised ones, have become a hot research topic. Existing unsupervised methods …

Positive and Negative Set Designs in Contrastive Feature Learning for Temporal Action Segmentation

YC Chen, WT Chu - IEEE Transactions on Circuits and Systems …, 2024 - ieeexplore.ieee.org
When data labels are scarce, contrastive learning is often used to learn representations in a
weakly-supervised or unsupervised way. In contrastive learning, not only the learning …

Weakly supervised temporal action localization with actionness-guided false positive suppression

Z Li, Z Wang, Q Liu - Neural Networks, 2024 - Elsevier
Weakly supervised temporal action localization aims to locate the temporal boundaries of
action instances in untrimmed videos using video-level labels and assign them the …

Diffusion-based framework for weakly-supervised temporal action localization

Y Zou, Q Zhao, PK Sarker, S Li, L Wang, W Liu - Pattern Recognition, 2025 - Elsevier
Weakly supervised temporal action localization aims to localize action instances with only
video-level supervision. Due to the absence of frame-level annotation supervision, how …

A Snippets Relation and Hard-Snippets Mask Network for Weakly-Supervised Temporal Action Localization

Y Zhao, H Zhang, Z Gao, W Guan… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org
Weakly-supervised temporal action localization (WTAL) is a problem learning an action
localization model with only video-level labels available. In recent years, many WTAL …

Snippet-to-Prototype Contrastive Consensus Network for Weakly Supervised Temporal Action Localization

Y Shao, F Zhang, C Xu - IEEE Transactions on Multimedia, 2024 - ieeexplore.ieee.org
Weakly-supervised temporal action localization aims to localize action instances from
untrimmed videos with only video-level labels. Due to the lack of frame-wise annotations …

Weakly-supervised Action Learning in Procedural Task Videos via Process Knowledge Decomposition

M Zou, Q Zeng, X Zhang - … on Circuits and Systems for Video …, 2024 - ieeexplore.ieee.org
Action learning is a research area that aims to recognize the action category of each frame
in the video. Context information is crucial for learning actions, but most existing methods …

Text-Video Knowledge Guided Prompting for Weakly Supervised Temporal Action Localization

Y Shao, F Zhang, C Xu - … on Circuits and Systems for Video …, 2024 - ieeexplore.ieee.org
Weakly supervised temporal action localization (WTAL) aims to localize action instances
with only video-level labels for supervision. Recent methods convert category labels to …

Weakly-supervised temporal action localization using multi-branch attention weighting

M Liu, W Li, F Ge, X Gao - Multimedia Systems, 2024 - Springer
Weakly-supervised temporal action localization aims to train an accurate and robust
localization model using only video-level labels. Due to the lack of frame-level temporal …