查看文章

MED-GPVS: A deep learning-based joint biomedical image classification and visual question answering system for precision e-health

作者

Harishma T Haridas, Mostafa M Fouda, Zubair Md Fadlullah, Mohamed Mahmoud, Basem M ElHalawany, Mohsen Guizani

发表日期

2022/5/16

研讨会论文

ICC 2022-IEEE International Conference on Communications

页码范围

3838-3843

出版商

IEEE

简介

General Purpose Vision System (GPVS) is a task-agnostic vision-language system that inputs an image and a question from which the system recognizes the tasks to be performed and outputs bounding boxes, confidence scores, and text outputs to answer the question. While much attention to GPVS has been recently given in the computer vision field, its medical field applications are still in their infancy. This paper presents MED-GPVS, a customized deep learning-based GPVS on biomedical images to perform various vision tasks, such as object detection and visual question answering, on medical images to facilitate precision medicine/e-health services. Our envisioned MED-GPVS takes an image and a natural language text as inputs, and then outputs bounding boxes, confidence scores, and generates a caption (i.e., the answer to the posed query). For example, if a medical image of a patient’s abdomen is …

引用总数

被引用次数：6

202320245 1

学术搜索中的文章

MED-GPVS: A deep learning-based joint biomedical image classification and visual question answering system for precision e-health

HT Haridas, MM Fouda, ZM Fadlullah, M Mahmoud… - ICC 2022-IEEE International Conference on …, 2022

被引用次数：6 相关文章所有 2 个版本