相关文章- 学术资源搜索

[PDF][PDF] Benchmarking and Analyzing Bird's Eye View Perception Robustness to Corruptions

S Xie, L Kong, W Zhang, J Ren, L Pan, K Chen, Z Liu - researchgate.net

Recent advancements in bird's eye view (BEV) representations have shown remarkable
promise for in-vehicle 3D perception. However, while these methods have achieved …

Debiased-CAM to mitigate image perturbations with faithful visual explanations of machine learning

W Zhang, M Dimiccoli, BY Lim - … of the 2022 CHI Conference on Human …, 2022 - dl.acm.org

Model explanations such as saliency maps can improve user trust in AI by highlighting
important features for a prediction. However, these become distorted and misleading when …

被引用次数：16 相关文章所有 9 个版本

[PDF] arxiv.org

Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop Reasoning

B Zhao, Y Zong, L Zhang, T Hospedales - arXiv preprint arXiv:2406.12742, 2024 - arxiv.org

The advancement of large language models (LLMs) has significantly broadened the scope
of applications in natural language processing, with multi-modal LLMs extending these …

Managing the risks of inevitably biased visual artificial intelligence systems

A Caliskan, R Steed - 2022 - policycommons.net

Scientists have long been developing machines that attempt to imitate the human brain. Just
as humans are exposed to systemic injustices, machines learn human-like stereotypes and …

被引用次数：1 相关文章

[HTML] sciencedirect.com

[HTML][HTML] Human attention guided explainable artificial intelligence for computer vision models

G Liu, J Zhang, AB Chan, JH Hsiao - Neural Networks, 2024 - Elsevier

Explainable artificial intelligence (XAI) has been increasingly investigated to enhance the
transparency of black-box artificial intelligence models, promoting better user understanding …

被引用次数：5 相关文章所有 5 个版本

[PDF] arxiv.org

Analyzing and mitigating object hallucination in large vision-language models

Y Zhou, C Cui, J Yoon, L Zhang, Z Deng, C Finn… - arXiv preprint arXiv …, 2023 - arxiv.org

Large vision-language models (LVLMs) have shown remarkable abilities in understanding
visual information with human languages. However, LVLMs still suffer from object …

被引用次数：77 相关文章所有 4 个版本

Towards Interpretable and Fair Computer Vision

N Meister - 2022 - dataspace.princeton.edu

As machine learning and computer vision are increasingly applied to high-impact, high-risk
domains, there have been numerous new methods aimed at making AI models more human …

[PDF] acm.org

Learning GAN-Based Foveated Reconstruction to Recover Perceptually Important Image Features

L Surace, M Wernikowski, C Tursun… - ACM transactions on …, 2023 - dl.acm.org

A foveated image can be entirely reconstructed from a sparse set of samples distributed
according to the retinal sensitivity of the human visual system, which rapidly decreases with …

被引用次数：2 相关文章所有 5 个版本

Wh-AI-les: Exploring harmonized vision models robustness against distribution shift

M Mounsif, M Benabdelkrim… - 2023 IEEE 13th …, 2023 - ieeexplore.ieee.org

The remarkable and increasing efficiency of learning-based vision strategies has induced
strong paradigm shift in favor of neural architectures that are consequently finding their way …

Deep Learning for Computer Vision: Recent Breakthroughs and Emerging Trends

SS Ittannavar, BP Khot… - … and Smart Electrical …, 2023 - ieeexplore.ieee.org

Significant strides have been achieved in the use of deep learning to computer vision, which
has changed the way that computers process and respond to visual data. The authors of this …

高级搜索

QQ 群