相关文章- 学术资源搜索

DeViDe: Faceted medical knowledge for improved medical vision-language pre-training

H Luo, Z Zhou, C Royer, A Sekuboyina… - arXiv preprint arXiv …, 2024 - arxiv.org

Vision-language pre-training for chest X-rays has made significant strides, primarily by
utilizing paired radiographs and radiology reports. However, existing approaches often face …

被引用次数：5 相关文章所有 2 个版本

[PDF] arxiv.org

Imitate: Clinical prior guided hierarchical vision-language pre-training

C Liu, S Cheng, M Shi, A Shah, W Bai… - IEEE Transactions on …, 2024 - ieeexplore.ieee.org

In the field of medical Vision-Language Pretraining (VLP), significant efforts have been
devoted to deriving text and image features from both clinical reports and associated …

被引用次数：16 相关文章所有 2 个版本

[PDF] arxiv.org

Xlip: Cross-modal attention masked modelling for medical language-image pre-training

B Wu, Y Xie, Z Zhang, MH Phan, Q Chen… - arXiv preprint arXiv …, 2024 - arxiv.org

Vision-and-language pretraining (VLP) in the medical field utilizes contrastive learning on
image-text pairs to achieve effective transfer across tasks. Yet, current VLP approaches with …

被引用次数：3 相关文章所有 2 个版本

[PDF] arxiv.org

Grounded Knowledge-Enhanced Medical VLP for Chest X-Ray

Q Deng, Z Huang, Y Wang, Z Wang, Z Wang… - arXiv preprint arXiv …, 2024 - arxiv.org

Medical vision-language pre-training has emerged as a promising approach for learning
domain-general representations of medical image and text. Current algorithms that exploit …

Medical image understanding with pretrained vision language models: A comprehensive study

Z Qin, H Yi, Q Lao, K Li - arXiv preprint arXiv:2209.15517, 2022 - arxiv.org

The large-scale pre-trained vision language models (VLM) have shown remarkable domain
transfer capability on natural images. However, it remains unknown whether this capability …

被引用次数：52 相关文章所有 4 个版本

[PDF] nature.com

Knowledge-enhanced visual-language pre-training on chest radiology images

X Zhang, C Wu, Y Zhang, W Xie, Y Wang - Nature Communications, 2023 - nature.com

While multi-modal foundation models pre-trained on large-scale data have been successful
in natural language understanding and vision recognition, their use in medical domains is …

被引用次数：70 相关文章所有 9 个版本

[PDF] arxiv.org

Freeze the Backbones: a Parameter-Efficient Contrastive Approach to Robust Medical Vision-Language Pre-Training

J Qin, C Liu, S Cheng, Y Guo… - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org

Modern healthcare often utilises radiographic images alongside textual reports for
diagnostics, encouraging the use of Vision-Language Self-Supervised Learning (VL-SSL) …

被引用次数：2 相关文章所有 3 个版本

[PDF] arxiv.org

Utilizing synthetic data for medical vision-language pre-training: Bypassing the need for real images

C Liu, A Shah, W Bai, R Arcucci - arXiv preprint arXiv:2310.07027, 2023 - arxiv.org

Medical Vision-Language Pre-training (VLP) learns representations jointly from medical
images and paired radiology reports. It typically requires large-scale paired image-text …

被引用次数：11 相关文章所有 2 个版本

[PDF] arxiv.org

M-flag: Medical vision-language pre-training with frozen language models and latent space geometry optimization

C Liu, S Cheng, C Chen, M Qiao, W Zhang… - … Conference on Medical …, 2023 - Springer

Medical vision-language models enable co-learning and integrating features from medical
imaging and clinical text. However, these models are not easy to train and the latent …

被引用次数：49 相关文章所有 4 个版本

[PDF] arxiv.org

MOSMOS: Multi-organ segmentation facilitated by medical report supervision

W Tian, X Huang, J Hou, C Ren, L Jiang… - arXiv preprint arXiv …, 2024 - arxiv.org

Owing to a large amount of multi-modal data in modern medical systems, such as medical
images and reports, Medical Vision-Language Pre-training (Med-VLP) has demonstrated …

高级搜索

QQ 群

DeViDe: Faceted medical knowledge for improved medical vision-language pre-training

Imitate: Clinical prior guided hierarchical vision-language pre-training

Xlip: Cross-modal attention masked modelling for medical language-image pre-training

Grounded Knowledge-Enhanced Medical VLP for Chest X-Ray

Medical image understanding with pretrained vision language models: A comprehensive study

Knowledge-enhanced visual-language pre-training on chest radiology images

Freeze the Backbones: a Parameter-Efficient Contrastive Approach to Robust Medical Vision-Language Pre-Training

Utilizing synthetic data for medical vision-language pre-training: Bypassing the need for real images

M-flag: Medical vision-language pre-training with frozen language models and latent space geometry optimization

MOSMOS: Multi-organ segmentation facilitated by medical report supervision

相关搜索

引用