Conditional Video Diffusion Network for Fine-grained Temporal Sentence Grounding

X Fang, Z Xiong, W Fang, X Qu, C Chen, J Dong… - … on Computer Vision, 2025 - Springer

This paper addresses the challenging task of weakly-supervised video temporal grounding.
Existing approaches are generally based on the moment proposal selection framework that …

被引用次数：10 相关文章所有 4 个版本

[PDF] openreview.net

Not all inputs are valid: Towards open-set video moment retrieval using language

X Fang, W Fang, D Liu, X Qu, J Dong, P Zhou… - Proceedings of the …, 2024 - dl.acm.org

Video Moment Retrieval (VMR) targets to retrieve the specific moment corresponding to a
sentence query from an untrimmed video. Although recent respectable works have made …

被引用次数：6 相关文章所有 3 个版本

Rethinking Video Sentence Grounding From a Tracking Perspective With Memory Network and Masked Attention

Z Xiong, D Liu, X Fang, X Qu, J Dong… - IEEE Transactions …, 2024 - ieeexplore.ieee.org

Video sentence grounding (VSG) is the task of identifying the segment of an untrimmed
video that semantically corresponds to a given natural language query. While many existing …

被引用次数：3 相关文章所有 2 个版本

[PDF] arxiv.org

Multi-Pair Temporal Sentence Grounding via Multi-Thread Knowledge Transfer Network

X Fang, W Fang, C Wang, D Liu, K Tang… - arXiv preprint arXiv …, 2024 - arxiv.org

Given some video-query pairs with untrimmed videos and sentence queries, temporal
sentence grounding (TSG) aims to locate query-relevant segments in these videos. Although …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Repetitive Action Counting with Hybrid Temporal Relation Modeling

K Li, X Peng, D Guo, X Yang, M Wang - arXiv preprint arXiv:2412.07233, 2024 - arxiv.org

Repetitive Action Counting (RAC) aims to count the number of repetitive actions occurring in
videos. In the real world, repetitive actions have great diversity and bring numerous …

被引用次数：1 相关文章所有 2 个版本

DiffusionVMR: Diffusion Model for Joint Video Moment Retrieval and Highlight Detection

H Zhao, KQ Lin, R Yan, Z Li - IEEE Transactions on Neural …, 2024 - ieeexplore.ieee.org

Video moment retrieval and highlight detection have received attention in the current era of
video content proliferation, aiming to localize moments and estimate clip relevances based …

[PDF] arxiv.org

被引用次数：5 相关文章

高级搜索

QQ 群