Rethinking Video Sentence Grounding From a Tracking Perspective With Memory Network and Masked...

文章

学术资源搜索

获得 3 条结果（用时0.02秒）

我的图书馆

Rethinking Video Sentence Grounding From a Tracking Perspective With Memory Network and Masked...

在引用文章中搜索

[PDF] openreview.net

Not all inputs are valid: Towards open-set video moment retrieval using language

X Fang, W Fang, D Liu, X Qu, J Dong, P Zhou… - Proceedings of the …, 2024 - dl.acm.org

Video Moment Retrieval (VMR) targets to retrieve the specific moment corresponding to a
sentence query from an untrimmed video. Although recent respectable works have made …

被引用次数：6 相关文章所有 3 个版本

[PDF] arxiv.org

Multi-Pair Temporal Sentence Grounding via Multi-Thread Knowledge Transfer Network

X Fang, W Fang, C Wang, D Liu, K Tang… - arXiv preprint arXiv …, 2024 - arxiv.org

Given some video-query pairs with untrimmed videos and sentence queries, temporal
sentence grounding (TSG) aims to locate query-relevant segments in these videos. Although …

被引用次数：1 相关文章所有 2 个版本

[PDF] openreview.net

Temporal Sentence Grounding with Relevance Feedback in Videos

J Dong, X Peng, D Liu, X Qu, X Yang, C Bao… - The Thirty-eighth Annual … - openreview.net

As a widely explored multi-modal task, Temporal Sentence Grounding in videos (TSG)
endeavors to retrieve a specific video segment matched with a given query text from a video …

高级搜索

QQ 群

Rethinking Video Sentence Grounding From a Tracking Perspective With Memory Network and Masked...

Not all inputs are valid: Towards open-set video moment retrieval using language

Multi-Pair Temporal Sentence Grounding via Multi-Thread Knowledge Transfer Network

Temporal Sentence Grounding with Relevance Feedback in Videos

引用