D Mascharka, P Tran, R Soklaski… - arXiv e-prints, 2018 - ui.adsabs.harvard.edu
Visual question answering requires high-order reasoning about an image, which is a
fundamental capability needed by machine systems to follow complex directives. Recently …