J Gu, Z Li - Proceedings of the 33rd ACM International Conference …, 2024 - dl.acm.org
Recent studies have found that many VQA models are influenced by biases, preventing
them from effectively using multimodal information for reasoning. Consequently, these …