[PDF][PDF] Benchmarking and Analyzing Bird's Eye View Perception Robustness to Corruptions

S Xie, L Kong, W Zhang, J Ren, L Pan, K Chen, Z Liu - researchgate.net
Recent advancements in bird's eye view (BEV) representations have shown remarkable
promise for in-vehicle 3D perception. However, while these methods have achieved …

Debiased-CAM to mitigate image perturbations with faithful visual explanations of machine learning

W Zhang, M Dimiccoli, BY Lim - … of the 2022 CHI Conference on Human …, 2022 - dl.acm.org
Model explanations such as saliency maps can improve user trust in AI by highlighting
important features for a prediction. However, these become distorted and misleading when …

Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop Reasoning

B Zhao, Y Zong, L Zhang, T Hospedales - arXiv preprint arXiv:2406.12742, 2024 - arxiv.org
The advancement of large language models (LLMs) has significantly broadened the scope
of applications in natural language processing, with multi-modal LLMs extending these …

Managing the risks of inevitably biased visual artificial intelligence systems

A Caliskan, R Steed - 2022 - policycommons.net
Scientists have long been developing machines that attempt to imitate the human brain. Just
as humans are exposed to systemic injustices, machines learn human-like stereotypes and …

[HTML][HTML] Human attention guided explainable artificial intelligence for computer vision models

G Liu, J Zhang, AB Chan, JH Hsiao - Neural Networks, 2024 - Elsevier
Explainable artificial intelligence (XAI) has been increasingly investigated to enhance the
transparency of black-box artificial intelligence models, promoting better user understanding …

Analyzing and mitigating object hallucination in large vision-language models

Y Zhou, C Cui, J Yoon, L Zhang, Z Deng, C Finn… - arXiv preprint arXiv …, 2023 - arxiv.org
Large vision-language models (LVLMs) have shown remarkable abilities in understanding
visual information with human languages. However, LVLMs still suffer from object …

Towards Interpretable and Fair Computer Vision

N Meister - 2022 - dataspace.princeton.edu
As machine learning and computer vision are increasingly applied to high-impact, high-risk
domains, there have been numerous new methods aimed at making AI models more human …

Learning GAN-Based Foveated Reconstruction to Recover Perceptually Important Image Features

L Surace, M Wernikowski, C Tursun… - ACM transactions on …, 2023 - dl.acm.org
A foveated image can be entirely reconstructed from a sparse set of samples distributed
according to the retinal sensitivity of the human visual system, which rapidly decreases with …

Wh-AI-les: Exploring harmonized vision models robustness against distribution shift

M Mounsif, M Benabdelkrim… - 2023 IEEE 13th …, 2023 - ieeexplore.ieee.org
The remarkable and increasing efficiency of learning-based vision strategies has induced
strong paradigm shift in favor of neural architectures that are consequently finding their way …

Deep Learning for Computer Vision: Recent Breakthroughs and Emerging Trends

SS Ittannavar, BP Khot… - … and Smart Electrical …, 2023 - ieeexplore.ieee.org
Significant strides have been achieved in the use of deep learning to computer vision, which
has changed the way that computers process and respond to visual data. The authors of this …