X Fang, D Liu, P Zhou, G Nan - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Given an untrimmed video, temporal sentence grounding (TSG) aims to locate a target moment semantically according to a sentence query. Although previous respectable works …
D Liu, X Fang, P Zhou, X Di, W Lu… - Proceedings of the AAAI …, 2023 - ojs.aaai.org
Given an untrimmed video, temporal sentence localization (TSL) aims to localize a specific segment according to a given sentence query. Though respectable works have made …
X Jiang, Z Zhou, X Xu, Y Yang, G Wang… - Proceedings of the 31st …, 2023 - dl.acm.org
Video Moment Retrieval (VMR) aims at retrieving the most relevant events from an untrimmed video with natural language queries. Existing VMR methods suffer from two …
C Tan, J Lai, WS Zheng, JF Hu - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Abstract Video Paragraph Grounding (VPG) is an emerging task in video-language understanding which aims at localizing multiple sentences with semantic relations and …
K Flanagan, D Damen, M Wray - arXiv preprint arXiv:2310.17395, 2023 - arxiv.org
The onset of long-form egocentric datasets such as Ego4D and EPIC-Kitchens presents a new challenge for the task of Temporal Sentence Grounding (TSG). Compared to traditional …
This paper presents a novel hierarchical alignment model (HAM) that learns multi- granularity visual and linguistic representations in an end-to-end manner. We extract key …
S Kim, J Cho, J Yu, YJ Yoo, JY Choi - Proceedings of the AAAI …, 2024 - ojs.aaai.org
In the weakly supervised temporal video grounding study, previous methods use predetermined single Gaussian proposals which lack the ability to express diverse events …
Early weakly supervised video grounding (WSVG) methods often struggle with incomplete boundary detection due to the absence of temporal boundary annotations. To bridge the gap …
Abstract Video Moment Retrieval (VMR) is a challenging task at the intersection of vision and language, with the goal to retrieve relevant moments from videos corresponding to …