查看文章

thecvf.com 中的 [PDF]

Ask your neurons: A neural-based approach to answering questions about images

作者

Mateusz Malinowski, Marcus Rohrbach, Mario Fritz

发表日期

2015

研讨会论文

Proceedings of the IEEE international conference on computer vision

页码范围

1-9

简介

We address a question answering task on real-world images that is set up as a Visual Turing Test. By combining latest advances in image representation and natural language processing, we propose Neural-Image-QA, an end-to-end formulation to this problem for which all parts are trained jointly. In contrast to previous efforts, we are facing a multi-modal problem where the language output (answer) is conditioned on visual and natural language input (image and question). Our approach Neural-Image-QA doubles the performance of the previous best approach on this problem. We provide additional insights into the problem by analyzing how much information is contained only in the language part for which we provide a new human baseline. To study human consensus, which is related to the ambiguities inherent in this challenging task, we propose two novel metrics and collect additional answers which extends the original DAQUAR dataset to DAQUAR-Consensus.

引用总数

被引用次数：773

201520162017201820192020202120222023202417 65 113 136 107 85 70 69 49 32

学术搜索中的文章

Ask your neurons: A neural-based approach to answering questions about images

M Malinowski, M Rohrbach, M Fritz - Proceedings of the IEEE international conference on …, 2015

被引用次数：773 相关文章所有 16 个版本