Llama 2: Open foundation and fine-tuned chat models H Touvron, L Martin, K Stone, P Albert, A Almahairi, Y Babaei, ... arXiv preprint arXiv:2307.09288, 2023 | 5962 | 2023 |
Flava: A foundational language and vision alignment model A Singh*, R Hu*, V Goswami*, G Couairon, W Galuba, M Rohrbach, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 519 | 2022 |
12-in-1: Multi-task vision and language representation learning J Lu*, V Goswami*, M Rohrbach, D Parikh, S Lee Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020 | 514 | 2020 |
No language left behind: Scaling human-centered machine translation MR Costa-jussà, J Cross, O Çelebi, M Elbayad, K Heafield, K Heffernan, ... arXiv preprint arXiv:2207.04672, 2022 | 489 | 2022 |
The hateful memes challenge: Detecting hate speech in multimodal memes D Kiela, H Firooz, A Mohan, V Goswami, A Singh, P Ringshia, ... Advances in neural information processing systems 33, 2611-2624, 2020 | 483 | 2020 |
MMF: A multimodal framework for vision and language research A Singh, V Goswami, V Natarajan, Y Jiang, X Chen, M Shah, M Rohrbach, ... URL: https://github. com/facebookresearch/mmf, 0 | 355* | |
Only time can tell: Discovering temporal data for temporal modeling L Sevilla-Lara, S Zha, Z Yan, V Goswami, M Feiszli, L Torresani Proceedings of the IEEE/CVF winter conference on applications of computer …, 2021 | 79 | 2021 |
Creative sketch generation S Ge, V Goswami, CL Zitnick, D Parikh arXiv preprint arXiv:2011.10039, 2020 | 67 | 2020 |
The hateful memes challenge: Competition report D Kiela, H Firooz, A Mohan, V Goswami, A Singh, CA Fitzpatrick, P Bull, ... NeurIPS 2020 Competition and Demonstration Track, 344-360, 2021 | 59 | 2021 |
Human-adversarial visual question answering S Sheng, A Singh, V Goswami, J Magana, T Thrush, W Galuba, D Parikh, ... Advances in Neural Information Processing Systems 34, 20346-20359, 2021 | 56 | 2021 |
Are we pretraining it right? digging deeper into visio-linguistic pretraining A Singh, V Goswami, D Parikh arXiv preprint arXiv:2004.08744, 2020 | 47 | 2020 |
Movie: Revisiting modulated convolutions for visual counting and beyond DK Nguyen, V Goswami, X Chen arXiv preprint arXiv:2004.11883, 2020 | 33 | 2020 |
Speechmatrix: A large-scale mined corpus of multilingual speech-to-speech translations PA Duquenne, H Gong, N Dong, J Du, A Lee, V Goswani, C Wang, J Pino, ... arXiv preprint arXiv:2211.04508, 2022 | 22 | 2022 |
Tricks for training sparse translation models D Dua, S Bhosale, V Goswami, J Cross, M Lewis, A Fan arXiv preprint arXiv:2110.08246, 2021 | 21 | 2021 |
Muavic: A multilingual audio-visual corpus for robust speech recognition and robust speech-to-text translation M Anwar, B Shi, V Goswami, WN Hsu, J Pino, C Wang arXiv preprint arXiv:2303.00628, 2023 | 19 | 2023 |
Knowledge extraction and annotation for cross-domain textual case-based reasoning in biologically inspired design S Rugaber, S Bhati, V Goswami, E Spiliopoulou, S Azad, S Koushik, ... Case-Based Reasoning Research and Development: 24th International Conference …, 2016 | 15 | 2016 |
Causes and cures for interference in multilingual translation U Shaham, M Elbayad, V Goswami, O Levy, S Bhosale arXiv preprint arXiv:2212.07530, 2022 | 13 | 2022 |
Revisiting machine translation for cross-lingual classification M Artetxe, V Goswami, S Bhosale, A Fan, L Zettlemoyer arXiv preprint arXiv:2305.14240, 2023 | 12 | 2023 |
Unsupervised image-to-video clothing transfer A Pumarola, V Goswami, F Vicente, F De la Torre, F Moreno-Noguer Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019 | 11 | 2019 |
Small data, big impact: Leveraging minimal data for effective machine translation J Maillard, C Gao, E Kalbassi, KR Sadagopan, V Goswami, P Koehn, ... Proceedings of the 61st Annual Meeting of the Association for Computational …, 2023 | 10 | 2023 |