Transparency by design: Closing the gap between performance and interpretability in visual reasoning

D Mascharka, P Tran, R Soklaski… - Proceedings of the …, 2018 - openaccess.thecvf.com
Visual question answering requires high-order reasoning about an image, which is a
fundamental capability needed by machine systems to follow complex directives. Recently …

Clevr: A diagnostic dataset for compositional language and elementary visual reasoning

J Johnson, B Hariharan… - Proceedings of the …, 2017 - openaccess.thecvf.com
When building artificial intelligence systems that can reason and answer questions about
visual data, we need diagnostic tests to analyze our progress and discover short-comings …

Iterative visual reasoning beyond convolutions

X Chen, LJ Li, L Fei-Fei… - Proceedings of the IEEE …, 2018 - openaccess.thecvf.com
We present a novel framework for iterative visual reasoning. Our framework goes beyond
current recognition systems that lack the capability to reason beyond stack of convolutions …

Raven: A dataset for relational and analogical visual reasoning

C Zhang, F Gao, B Jia, Y Zhu… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com
Dramatic progress has been witnessed in basic vision tasks involving low-level perception,
such as object recognition, detection, and tracking. Unfortunately, there is still enormous …

Explainable and explicit visual reasoning over scene graphs

J Shi, H Zhang, J Li - … of the IEEE/CVF conference on …, 2019 - openaccess.thecvf.com
We aim to dismantle the prevalent black-box neural architectures used in complex visual
reasoning tasks, into the proposed eXplainable and eXplicit Neural Modules (XNMs), which …

Explicit knowledge incorporation for visual reasoning

Y Zhang, M Jiang, Q Zhao - … of the IEEE/CVF conference on …, 2021 - openaccess.thecvf.com
Existing explainable and explicit visual reasoning methods only perform reasoning based
on visual evidence but do not take into account knowledge beyond what is in the visual …

Inferring and executing programs for visual reasoning

J Johnson, B Hariharan… - Proceedings of the …, 2017 - openaccess.thecvf.com
Existing methods for visual reasoning attempt to directly map inputs to outputs using black-
box architectures without explicitly modeling the underlying reasoning processes. As a …

A peek into the reasoning of neural networks: Interpreting with structural visual concepts

Y Ge, Y Xiao, Z Xu, M Zheng… - Proceedings of the …, 2021 - openaccess.thecvf.com
Despite substantial progress in applying neural networks (NN) to a wide variety of areas,
they still largely suffer from a lack of transparency and interpretability. While recent …

Neural module networks

J Andreas, M Rohrbach, T Darrell… - Proceedings of the …, 2016 - openaccess.thecvf.com
Visual question answering is fundamentally compositional in nature---a question like" where
is the dog?" shares substructure with questions like" what color is the dog?" and" where is …

Learning conditioned graph structures for interpretable visual question answering

W Norcliffe-Brown, S Vafeias… - Advances in neural …, 2018 - proceedings.neurips.cc
Visual Question answering is a challenging problem requiring a combination of concepts
from Computer Vision and Natural Language Processing. Most existing approaches use a …