CUNSB-RFIE: Context-aware Unpaired Neural Schr\" odinger Bridge in Retinal Fundus Image Enhancement

X Dong, VK Vasa, W Zhu, P Qiu, X Chen, Y Su… - arXiv preprint arXiv …, 2024 - arxiv.org
Retinal fundus photography is significant in diagnosing and monitoring retinal diseases.
However, systemic imperfections and operator/patient-related factors can hinder the …

Advanced Visual and Textual Co-context Aware Attention Network with Dependent Multimodal Fusion Block for Visual Question Answering

HS Asri, R Safabakhsh - Multimedia Tools and Applications, 2024 - Springer
Visual question answering (VQA) is a multimodal task requiring a simultaneous
understanding of both visual and textual content. Therefore, image and question …

Multiscale Latent Diffusion Model for Enhanced Feature Extraction from Medical Images

RT Sadia, J Zhang, J Chen - arXiv preprint arXiv:2410.04000, 2024 - arxiv.org
Various imaging modalities are used in patient diagnosis, each offering unique advantages
and valuable insights into anatomy and pathology. Computed Tomography (CT) is crucial in …