Y Guo, J Liu, M Li,
X Tang, X Chen, B Zhao - arXiv preprint arXiv …, 2024 - arxiv.org
Video Temporal Grounding (VTG) focuses on accurately identifying event timestamps within
a particular video based on a linguistic query, playing a vital role in downstream tasks such …