F Xu, W Zhou, T Sun, J Lu, Z Yu, G Li - International Conference on …, 2024 - Springer
Abstract The task of Video-Grounded Dialogue involves developing a multimodal chatbot
capable of answering sequential questions from humans regarding video content, audio …