M Peng, L Liu, Z Li, Y Shi, X Zhou - 2023 IEEE International …, 2023 - ieeexplore.ieee.org
Video question answering challenges models on understanding textual questions with
varying complexity and searching for clues from visual content with different hierarchical …