Grad-cam: Visual explanations from deep networks via gradient-based localization RR Selvaraju, M Cogswell, A Das, R Vedantam, D Parikh, D Batra Proceedings of the IEEE international conference on computer vision, 618-626, 2017 | 22441 | 2017 |
VQA: Visual Question Answering A Agrawal, J Lu, S Antol, M Mitchell, CL Zitnick, D Parikh, D Batra International Journal of Computer Vision, 1-28, 2015 | 5901* | 2015 |
Cider: Consensus-based image description evaluation R Vedantam, C Lawrence Zitnick, D Parikh Proceedings of the IEEE conference on computer vision and pattern …, 2015 | 4663 | 2015 |
Vilbert: Pretraining task-agnostic visiolinguistic representations for vision-and-language tasks J Lu, D Batra, D Parikh, S Lee Advances in neural information processing systems 32, 2019 | 3499 | 2019 |
Making the V in VQA Matter: Elevating the Role of Image Understanding in Visual Question Answering Y Goyal, T Khot, D Summers-Stay, D Batra, D Parikh arXiv preprint arXiv:1612.00837, 2016 | 2761 | 2016 |
Hierarchical question-image co-attention for visual question answering J Lu, J Yang, D Batra, D Parikh Advances in neural information processing systems 29, 2016 | 1945 | 2016 |
Knowing when to look: Adaptive attention via a visual sentinel for image captioning J Lu, C Xiong, D Parikh, R Socher Proceedings of the IEEE conference on computer vision and pattern …, 2017 | 1766 | 2017 |
Habitat: A platform for embodied ai research M Savva, A Kadian, O Maksymets, Y Zhao, E Wijmans, B Jain, J Straub, ... Proceedings of the IEEE/CVF international conference on computer vision …, 2019 | 1285 | 2019 |
Relative attributes D Parikh, K Grauman 2011 International Conference on Computer Vision, 503-510, 2011 | 1221 | 2011 |
Visual dialog A Das, S Kottur, K Gupta, A Singh, D Yadav, JMF Moura, D Parikh, ... Proceedings of the IEEE conference on computer vision and pattern …, 2017 | 1106 | 2017 |
Joint unsupervised learning of deep representations and image clusters J Yang, D Parikh, D Batra Proceedings of the IEEE conference on computer vision and pattern …, 2016 | 961 | 2016 |
Graph r-cnn for scene graph generation J Yang, J Lu, S Lee, D Batra, D Parikh Proceedings of the European conference on computer vision (ECCV), 670-685, 2018 | 932 | 2018 |
A corpus and cloze evaluation for deeper understanding of commonsense stories N Mostafazadeh, N Chambers, X He, D Parikh, D Batra, L Vanderwende, ... Proceedings of NAACL HLT, San Diego, California, June. Association for …, 2016 | 863* | 2016 |
Make-a-video: Text-to-video generation without text-video data U Singer, A Polyak, T Hayes, X Yin, J An, S Zhang, Q Hu, H Yang, ... arXiv preprint arXiv:2209.14792, 2022 | 763 | 2022 |
Embodied question answering A Das, S Datta, G Gkioxari, S Lee, D Parikh, D Batra Proceedings of the IEEE conference on computer vision and pattern …, 2018 | 677 | 2018 |
Don't just assume; look and answer: Overcoming priors for visual question answering A Agrawal, D Batra, D Parikh, A Kembhavi Proceedings of the IEEE conference on computer vision and pattern …, 2018 | 663 | 2018 |
Towards vqa models that can read A Singh, V Natarajan, M Shah, Y Jiang, X Chen, D Batra, D Parikh, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019 | 642 | 2019 |
icoseg: Interactive co-segmentation with intelligent scribble guidance D Batra, A Kowdle, D Parikh, J Luo, T Chen 2010 IEEE computer society conference on computer vision and pattern …, 2010 | 621 | 2010 |
Neural baby talk J Lu, J Yang, D Batra, D Parikh Proceedings of the IEEE conference on computer vision and pattern …, 2018 | 554 | 2018 |
Counterfactual visual explanations Y Goyal, Z Wu, J Ernst, D Batra, D Parikh, S Lee International Conference on Machine Learning, 2376-2384, 2019 | 544 | 2019 |