作者
Harishma T Haridas, Mostafa M Fouda, Zubair Md Fadlullah, Mohamed Mahmoud, Basem M ElHalawany, Mohsen Guizani
发表日期
2022/5/16
研讨会论文
ICC 2022-IEEE International Conference on Communications
页码范围
3838-3843
出版商
IEEE
简介
General Purpose Vision System (GPVS) is a task-agnostic vision-language system that inputs an image and a question from which the system recognizes the tasks to be performed and outputs bounding boxes, confidence scores, and text outputs to answer the question. While much attention to GPVS has been recently given in the computer vision field, its medical field applications are still in their infancy. This paper presents MED-GPVS, a customized deep learning-based GPVS on biomedical images to perform various vision tasks, such as object detection and visual question answering, on medical images to facilitate precision medicine/e-health services. Our envisioned MED-GPVS takes an image and a natural language text as inputs, and then outputs bounding boxes, confidence scores, and generates a caption (i.e., the answer to the posed query). For example, if a medical image of a patient’s abdomen is …
引用总数
学术搜索中的文章