Chexagent: Towards a foundation model for chest x-ray interpretation

MC Ramos, CJ Collison, AD White - arXiv preprint arXiv:2407.01603, 2024 - arxiv.org

Large language models (LLMs) are emerging as a powerful tool in chemistry across multiple
domains. In chemistry, LLMs are able to accurately predict properties, design new …

[HTML] mdpi.com

[HTML][HTML] Contrastive learning penalized cross-entropy with diversity contrastive search decoding for diagnostic report generation of reduced token repetition

T Zhang, J Meng, Y Yang, S Yu - Applied Sciences, 2024 - mdpi.com

Medical imaging description and disease diagnosis are vitally important yet time-consuming.
Automated diagnosis report generation (DRG) from medical imaging description can reduce …

被引用次数：1 相关文章所有 2 个版本

Token-Mixer: Bind Image and Text in One Embedding Space for Medical Image Reporting

Y Yang, J Yu, Z Fu, K Zhang, T Yu… - … on Medical Imaging, 2024 - ieeexplore.ieee.org

Medical image reporting focused on automatically generating the diagnostic reports from
medical images has garnered growing research attention. In this task, learning cross-modal …

MAIRA-2: Grounded Radiology Report Generation

S Bannur, K Bouzid, DC Castro, A Schwaighofer… - arXiv preprint arXiv …, 2024 - arxiv.org

Radiology reporting is a complex task that requires detailed image understanding,
integration of multiple inputs, including comparison with prior imaging, and precise …

相关文章所有 2 个版本

[PDF] arxiv.org

Merlin: A Vision Language Foundation Model for 3D Computed Tomography

L Blankemeier, JP Cohen, A Kumar… - arXiv preprint arXiv …, 2024 - arxiv.org

Over 85 million computed tomography (CT) scans are performed annually in the US, of
which approximately one quarter focus on the abdomen. Given the current radiologist …

被引用次数：1 相关文章所有 2 个版本

[PDF] arxiv.org

Cross-Modality Jailbreak and Mismatched Attacks on Medical Multimodal Large Language Models

X Huang, X Wang, H Zhang, J Xi, J An, H Wang… - arXiv preprint arXiv …, 2024 - arxiv.org

Security concerns related to Large Language Models (LLMs) have been extensively
explored, yet the safety implications for Multimodal Large Language Models (MLLMs) …

相关文章所有 2 个版本

[PDF] arxiv.org

GREEN: Generative Radiology Report Evaluation and Error Notation

S Ostmeier, J Xu, Z Chen, M Varma… - arXiv preprint arXiv …, 2024 - arxiv.org

Evaluating radiology reports is a challenging problem as factual correctness is extremely
important due to the need for accurate medical communication about medical images …

相关文章所有 2 个版本

[PDF] arxiv.org

CheXpert Plus: Hundreds of Thousands of Aligned Radiology Texts, Images and Patients

P Chambon, JB Delbrouck, T Sounack… - arXiv preprint arXiv …, 2024 - arxiv.org

Since the release of the original CheXpert paper five years ago, CheXpert has become one
of the most widely used and cited clinical AI datasets. The emergence of vision language …

相关文章所有 2 个版本

[PDF] arxiv.org

Inquire, Interact, and Integrate: A Proactive Agent Collaborative Framework for Zero-Shot Multimodal Medical Reasoning

Z Gu, F Liu, C Yin, P Zhang - arXiv preprint arXiv:2405.11640, 2024 - arxiv.org

The adoption of large language models (LLMs) in healthcare has attracted significant
research interest. However, their performance in healthcare remains under-investigated and …

From Generalist to Specialist: Incorporating Domain-Knowledge into Flamingo for Chest X-Ray Report Generation

R Stock, S Denner, Y Kirchhoff, C Ulrich… - Medical Imaging with …, 2024 - openreview.net

Automating the generation of accurate and reliable radiological reports from chest X-ray
images represents a significant challenge in medical image computing. In this context …

高级搜索

QQ 群