Rad-restruct: A novel vqa benchmark and method for structured radiology reporting

Z Chen, M Varma, JB Delbrouck, M Paschali… - arXiv preprint arXiv …, 2024 - arxiv.org

Chest X-rays (CXRs) are the most frequently performed imaging test in clinical practice.
Recent advances in the development of vision-language foundation models (FMs) give rise …

被引用次数：35 相关文章所有 3 个版本

[PDF] arxiv.org

RaDialog: A large vision-language model for radiology report generation and conversational assistance

C Pellegrini, E Özsoy, B Busam, N Navab… - arXiv preprint arXiv …, 2023 - arxiv.org

Conversational AI tools that can generate and discuss clinically correct radiology reports for
a given medical image have the potential to transform radiology. Such a human-in-the-loop …

被引用次数：21 相关文章所有 2 个版本

[HTML] sciencedirect.com

[HTML][HTML] A systematic evaluation of gpt-4v's multimodal capability for chest x-ray image analysis

Y Liu, Y Li, Z Wang, X Liang, L Liu, L Wang, L Cui, Z Tu… - Meta-Radiology, 2024 - Elsevier

This work evaluates GPT-4V's multimodal capability for medical image analysis, focusing on
three representative tasks radiology report generation, medical visual question answering …

被引用次数：3 相关文章

[PDF] arxiv.org

Preference Fine-Tuning for Factuality in Chest X-Ray Interpretation Models Without Human Feedback

D Hein, Z Chen, S Ostmeier, J Xu, M Varma… - arXiv preprint arXiv …, 2024 - arxiv.org

Radiologists play a crucial role by translating medical images into medical reports. However,
the field faces staffing shortages and increasing workloads. While automated approaches …

A Survey of Deep Learning-based Radiology Report Generation Using Multimodal Data

X Wang, G Figueredo, R Li, WE Zhang, W Chen… - arXiv preprint arXiv …, 2024 - arxiv.org

Automatic radiology report generation can alleviate the workload for physicians and
minimize regional disparities in medical resources, therefore becoming an important topic in …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Efficient Bilinear Attention-based Fusion for Medical Visual Question Answering

Z Zhang, J Wang, R Zhu, X Gong - arXiv preprint arXiv:2410.21000, 2024 - arxiv.org

Medical Visual Question Answering (MedVQA) has gained increasing attention at the
intersection of computer vision and natural language processing. Its capability to interpret …

Enhancing Medical VQA with Self-Attention Based Multi-Model Approach

V Sakthivel, GB Mohan… - 2024 15th International …, 2024 - ieeexplore.ieee.org

In the medical and healthcare fields, the integration of clinical images and question-
answering systems is believed to be a powerful tool that has the capacity to change the …

[PDF] openreview.net

Context-Guided Medical Visual Question Answering

W Arsalane, P Chikontwe, M Luna, M Kang… - MICCAI Student Board … - openreview.net

Given a medical image and a question in natural language, medical VQA systems are
required to predict clinically relevant answers. Integrating information from visual and textual …

[引用][C] GRADUATION THESIS

TQ Duc, TTK Thanh

高级搜索

QQ 群