Tem-adapter: Adapting image-text pretraining for video question answer

G Chen, X Liu, G Wang, K Zhang… - Proceedings of the …, 2023 - openaccess.thecvf.com
Video-language pre-trained models have shown remarkable success in guiding video
question-answering (VideoQA) tasks. However, due to the length of video sequences …

Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer

G Chen, X Liu, G Wang, K Zhang, PHS Torr… - 2023 IEEE/CVF …, 2023 - computer.org
Video-language pre-trained models have shown remarkable success in guiding video
question-answering (VideoQA) tasks. However, due to the length of video sequences …

Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer

G Chen, X Liu, G Wang, K Zhang, PHS Torr… - arXiv preprint arXiv …, 2023 - arxiv.org
Video-language pre-trained models have shown remarkable success in guiding video
question-answering (VideoQA) tasks. However, due to the length of video sequences …

Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer

G Chen, X Liu, G Wang, K Zhang, PHS Torr… - arXiv e …, 2023 - ui.adsabs.harvard.edu
Video-language pre-trained models have shown remarkable success in guiding video
question-answering (VideoQA) tasks. However, due to the length of video sequences …

Tem-adapter: adapting image-text pretraining for video question answer

G Chen, X Liu, G Wang, K Zhang, PHS Torr, XP Zhang… - 2024 - ora.ox.ac.uk
Video-language pre-trained models have shown remarkable success in guiding video
question-answering (VideoQA) tasks. However, due to the length of video sequences …

Tem-adapter: Adapting Image-Text Pretraining for Video Question Answer

G Chen, X Liu, G Wang, K Zhang… - 2023 IEEE/CVF …, 2023 - ieeexplore.ieee.org
Video-language pre-trained models have shown remarkable success in guiding video
question-answering (VideoQA) tasks. However, due to the length of video sequences …