Not all inputs are valid: Towards open-set video moment retrieval using language

X Fang, W Fang, D Liu, X Qu, J Dong, P Zhou… - Proceedings of the …, 2024 - dl.acm.org
Video Moment Retrieval (VMR) targets to retrieve the specific moment corresponding to a
sentence query from an untrimmed video. Although recent respectable works have made …

Multi-Pair Temporal Sentence Grounding via Multi-Thread Knowledge Transfer Network

X Fang, W Fang, C Wang, D Liu, K Tang… - arXiv preprint arXiv …, 2024 - arxiv.org
Given some video-query pairs with untrimmed videos and sentence queries, temporal
sentence grounding (TSG) aims to locate query-relevant segments in these videos. Although …

Temporal Sentence Grounding with Relevance Feedback in Videos

J Dong, X Peng, D Liu, X Qu, X Yang, C Bao… - The Thirty-eighth Annual … - openreview.net
As a widely explored multi-modal task, Temporal Sentence Grounding in videos (TSG)
endeavors to retrieve a specific video segment matched with a given query text from a video …