Rethinking multi-modal alignment in multi-choice videoQA from feature and sample perspectives

文章

学术资源搜索

获得 4 条结果（用时0.02秒）

Rethinking multi-modal alignment in multi-choice videoQA from feature and sample perspectives

Counterfactual visual dialog: Robust commonsense knowledge learning from unbiased training

AA Liu, C Huang, N Xu, H Tian, J Liu… - IEEE Transactions on …, 2023 - ieeexplore.ieee.org

Visual Dialog (VD) requires an agent to answer the current question by engaging in a
conversation with humans referring to an image. Despite the recent progress, it is beneficial …

被引用次数：9 相关文章所有 2 个版本

Selective arguments representation with dual relation-aware network for video situation recognition

W Liu, Q He, C Wang, Y Peng, S Xie - Neural Computing and Applications, 2024 - Springer

Argument visual states are helpful for detecting structured components of events in videos,
and existing methods tend to use object detectors to generate their candidates. However …

Hierarchical Synergy-Enhanced Multimodal Relational Network for Video Question Answering

M Peng, X Shao, Y Shi, X Zhou - ACM Transactions on Multimedia …, 2023 - dl.acm.org

Video question answering (VideoQA) is challenging as it requires reasoning about natural
language and multimodal interactive relations. Most existing methods apply attention …

被引用次数：3 相关文章所有 2 个版本

[PDF] arxiv.org

HSDreport: Heart Sound Diagnosis with Echocardiography Reports

Z Zhao, P Wang, L Zhao, Y Yang, Y Zhang… - arXiv preprint arXiv …, 2024 - arxiv.org

Heart sound auscultation holds significant importance in the diagnosis of congenital heart
disease. However, existing methods for Heart Sound Diagnosis (HSD) tasks are …

高级搜索

QQ 群

Rethinking multi-modal alignment in multi-choice videoQA from feature and sample perspectives

Counterfactual visual dialog: Robust commonsense knowledge learning from unbiased training

Selective arguments representation with dual relation-aware network for video situation recognition

Hierarchical Synergy-Enhanced Multimodal Relational Network for Video Question Answering

HSDreport: Heart Sound Diagnosis with Echocardiography Reports

引用