X Liang, R Wang, G Lin, J Feng… - 2024 IEEE International …, 2024 - ieeexplore.ieee.org
Joint video moment retrieval and highlight detection is a video understanding task that
requires the model to construct multimodal interaction between heterogeneous features …