Chexagent: Towards a foundation model for chest x-ray interpretation

Z Chen, M Varma, JB Delbrouck, M Paschali… - arXiv preprint arXiv …, 2024 - arxiv.org
Chest X-rays (CXRs) are the most frequently performed imaging test in clinical practice.
Recent advances in the development of vision-language foundation models (FMs) give rise …

RaDialog: A large vision-language model for radiology report generation and conversational assistance

C Pellegrini, E Özsoy, B Busam, N Navab… - arXiv preprint arXiv …, 2023 - arxiv.org
Conversational AI tools that can generate and discuss clinically correct radiology reports for
a given medical image have the potential to transform radiology. Such a human-in-the-loop …

[HTML][HTML] A systematic evaluation of gpt-4v's multimodal capability for chest x-ray image analysis

Y Liu, Y Li, Z Wang, X Liang, L Liu, L Wang, L Cui, Z Tu… - Meta-Radiology, 2024 - Elsevier
This work evaluates GPT-4V's multimodal capability for medical image analysis, focusing on
three representative tasks radiology report generation, medical visual question answering …

Preference Fine-Tuning for Factuality in Chest X-Ray Interpretation Models Without Human Feedback

D Hein, Z Chen, S Ostmeier, J Xu, M Varma… - arXiv preprint arXiv …, 2024 - arxiv.org
Radiologists play a crucial role by translating medical images into medical reports. However,
the field faces staffing shortages and increasing workloads. While automated approaches …

A Survey of Deep Learning-based Radiology Report Generation Using Multimodal Data

X Wang, G Figueredo, R Li, WE Zhang, W Chen… - arXiv preprint arXiv …, 2024 - arxiv.org
Automatic radiology report generation can alleviate the workload for physicians and
minimize regional disparities in medical resources, therefore becoming an important topic in …

Efficient Bilinear Attention-based Fusion for Medical Visual Question Answering

Z Zhang, J Wang, R Zhu, X Gong - arXiv preprint arXiv:2410.21000, 2024 - arxiv.org
Medical Visual Question Answering (MedVQA) has gained increasing attention at the
intersection of computer vision and natural language processing. Its capability to interpret …

Enhancing Medical VQA with Self-Attention Based Multi-Model Approach

V Sakthivel, GB Mohan… - 2024 15th International …, 2024 - ieeexplore.ieee.org
In the medical and healthcare fields, the integration of clinical images and question-
answering systems is believed to be a powerful tool that has the capacity to change the …

Context-Guided Medical Visual Question Answering

W Arsalane, P Chikontwe, M Luna, M Kang… - MICCAI Student Board … - openreview.net
Given a medical image and a question in natural language, medical VQA systems are
required to predict clinically relevant answers. Integrating information from visual and textual …

[引用][C] GRADUATION THESIS

TQ Duc, TTK Thanh