Overcoming data limitation in medical visual question answering

Q Jin, Z Yuan, G Xiong, Q Yu, H Ying, C Tan… - ACM Computing …, 2022 - dl.acm.org

Automatic Question Answering (QA) has been successfully applied in various domains such
as search engines and chatbots. Biomedical QA (BQA), as an emerging QA task, enables …

被引用次数：128 相关文章所有 4 个版本

[PDF] arxiv.org

Medical visual question answering: A survey

Z Lin, D Zhang, Q Tao, D Shi, G Haffari, Q Wu… - Artificial Intelligence in …, 2023 - Elsevier

Abstract Medical Visual Question Answering (VQA) is a combination of medical artificial
intelligence and popular VQA challenges. Given a medical image and a clinically relevant …

被引用次数：107 相关文章所有 8 个版本

[PDF] arxiv.org

Pmc-vqa: Visual instruction tuning for medical visual question answering

X Zhang, C Wu, Z Zhao, W Lin, Y Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org

In this paper, we focus on the problem of Medical Visual Question Answering (MedVQA),
which is crucial in efficiently interpreting medical images with vital clinic-relevant information …

被引用次数：171 相关文章所有 2 个版本

[PDF] ieee.org

Meta-learning in neural networks: A survey

T Hospedales, A Antoniou, P Micaelli… - IEEE transactions on …, 2021 - ieeexplore.ieee.org

The field of meta-learning, or learning-to-learn, has seen a dramatic rise in interest in recent
years. Contrary to conventional approaches to AI where tasks are solved from scratch using …

被引用次数：2481 相关文章所有 10 个版本

[PDF] arxiv.org

Pmc-clip: Contrastive language-image pre-training using biomedical documents

W Lin, Z Zhao, X Zhang, C Wu, Y Zhang… - … Conference on Medical …, 2023 - Springer

Foundation models trained on large-scale dataset gain a recent surge in CV and NLP. In
contrast, development in biomedical domain lags far behind due to data scarcity. To address …

被引用次数：128 相关文章所有 6 个版本

[PDF] arxiv.org

Multi-modal masked autoencoders for medical vision-and-language pre-training

Z Chen, Y Du, J Hu, Y Liu, G Li, X Wan… - … Conference on Medical …, 2022 - Springer

Medical vision-and-language pre-training provides a feasible solution to extract effective
vision-and-language representations from medical images and texts. However, few studies …

被引用次数：122 相关文章所有 4 个版本

[PDF] arxiv.org

Slake: A semantically-labeled knowledge-enhanced dataset for medical visual question answering

B Liu, LM Zhan, L Xu, L Ma, Y Yang… - 2021 IEEE 18th …, 2021 - ieeexplore.ieee.org

Medical visual question answering (Med-VQA) has tremendous potential in healthcare.
However, the development of this technology is hindered by the lacking of publicly-available …

被引用次数：247 相关文章所有 7 个版本

[PDF] arxiv.org

Multi-modal understanding and generation for medical images and text via vision-language pre-training

JH Moon, H Lee, W Shin, YH Kim… - IEEE Journal of …, 2022 - ieeexplore.ieee.org

Recently a number of studies demonstrated impressive performance on diverse vision-
language multi-modal tasks such as image captioning and visual question answering by …

被引用次数：161 相关文章所有 9 个版本

[PDF] arxiv.org

Mmbert: Multimodal bert pretraining for improved medical vqa

Y Khare, V Bagal, M Mathew, A Devi… - 2021 IEEE 18th …, 2021 - ieeexplore.ieee.org

Images in the medical domain are fundamentally different from the general domain images.
Consequently, it is infeasible to directly employ general domain Visual Question Answering …

被引用次数：139 相关文章所有 7 个版本

[PDF] arxiv.org

Align, reason and learn: Enhancing medical vision-and-language pre-training with knowledge

Z Chen, G Li, X Wan - Proceedings of the 30th ACM International …, 2022 - dl.acm.org

Medical vision-and-language pre-training (Med-VLP) has received considerable attention
owing to its applicability to extracting generic vision-and-language representations from …

被引用次数：62 相关文章所有 4 个版本

高级搜索

QQ 群