Grad-CAM: visual explanations from deep networks via gradient-based localization

Application of explainable artificial intelligence for healthcare: A systematic review of the last decade (2011–2022)

HW Loh, CP Ooi, S Seoni, PD Barua, F Molinari… - Computer Methods and …, 2022 - Elsevier

Background and objectives Artificial intelligence (AI) has branched out to various
applications in healthcare, such as health services management, predictive medicine …

被引用次数：482 相关文章所有 8 个版本

[PDF] peerj.com

The multi-modal fusion in visual question answering: a review of attention mechanisms

S Lu, M Liu, L Yin, Z Yin, X Liu, W Zheng - PeerJ Computer Science, 2023 - peerj.com

Abstract Visual Question Answering (VQA) is a significant cross-disciplinary issue in the
fields of computer vision and natural language processing that requires a computer to output …

被引用次数：212 相关文章所有 8 个版本

[PDF] thecvf.com

Vipergpt: Visual inference via python execution for reasoning

D Surís, S Menon, C Vondrick - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Answering visual queries is a complex task that requires both visual processing and
reasoning. End-to-end models, the dominant approach for this task, do not explicitly …

被引用次数：401 相关文章所有 6 个版本

[PDF] 101.43.54.196

Obtaining genetics insights from deep learning via explainable artificial intelligence

G Novakovsky, N Dexter, MW Libbrecht… - Nature Reviews …, 2023 - nature.com

Artificial intelligence (AI) models based on deep learning now represent the state of the art
for making functional predictions in genomics research. However, the underlying basis on …

被引用次数：247 相关文章所有 4 个版本

[PDF] arxiv.org

Visual language maps for robot navigation

C Huang, O Mees, A Zeng… - 2023 IEEE International …, 2023 - ieeexplore.ieee.org

Grounding language to the visual observations of a navigating agent can be performed
using off-the-shelf visual-language models pretrained on Internet-scale data (eg, image …

被引用次数：355 相关文章所有 4 个版本

[PDF] arxiv.org

Visual classification via description from large language models

S Menon, C Vondrick - arXiv preprint arXiv:2210.07183, 2022 - arxiv.org

Vision-language models (VLMs) such as CLIP have shown promising performance on a
variety of recognition tasks using the standard zero-shot classification procedure--computing …

被引用次数：332 相关文章所有 3 个版本

[PDF] springer.com

Artificial intelligence for waste management in smart cities: a review

B Fang, J Yu, Z Chen, AI Osman, M Farghali… - Environmental …, 2023 - Springer

The rising amount of waste generated worldwide is inducing issues of pollution, waste
management, and recycling, calling for new strategies to improve the waste ecosystem, such …

被引用次数：207 相关文章所有 14 个版本

Explainable artificial intelligence: a comprehensive review

D Minh, HX Wang, YF Li, TN Nguyen - Artificial Intelligence Review, 2022 - Springer

Thanks to the exponential growth in computing power and vast amounts of data, artificial
intelligence (AI) has witnessed remarkable developments in recent years, enabling it to be …

被引用次数：578 相关文章所有 4 个版本

[PDF] arxiv.org

Uniformer: Unifying convolution and self-attention for visual recognition

K Li, Y Wang, J Zhang, P Gao, G Song… - … on Pattern Analysis …, 2023 - ieeexplore.ieee.org

It is a challenging task to learn discriminative representation from images and videos, due to
large local redundancy and complex global dependency in these visual data. Convolution …

被引用次数：396 相关文章所有 6 个版本

[PDF] rsna.org Full View

RadImageNet: an open radiologic deep learning research dataset for effective transfer learning

X Mei, Z Liu, PM Robson, B Marinelli… - Radiology: Artificial …, 2022 - pubs.rsna.org

Purpose To demonstrate the value of pretraining with millions of radiologic images
compared with ImageNet photographic images on downstream medical applications when …

被引用次数：223 相关文章所有 5 个版本

高级搜索

QQ 群