You only look once: Unified, real-time object detection J Redmon, S Divvala, R Girshick, A Farhadi CVPR, 2016 | 49154 | 2016 |
Yolov3: An incremental improvement J Redmon, A Farhadi arXiv preprint arXiv:1804.02767, 2018 | 28362 | 2018 |
YOLO9000: Better, Faster, Stronger J Redmon, A Farhadi CVPR, 2017 | 22259 | 2017 |
XNOR-Net: ImageNet Classification Using Binary Convolutional Neural Networks M Rastegari, V Ordonez, J Redmon, A Farhadi ECCV, 2016 | 5520 | 2016 |
Unsupervised Deep Embedding for Clustering Analysis J Xie, R Girshick, A Farhadi ICML, 2016 | 3359 | 2016 |
Describing objects by their attributes A Farhadi, I Endres, D Hoiem, D Forsyth Computer Vision and Pattern Recognition, 2009. CVPR 2009. IEEE Conference on …, 2009 | 2467 | 2009 |
Bidirectional Attention Flow for Machine Comprehension M Seo, A Kembhavi, A Farhadi, H Hajishirzi ICLR, 2017 | 2322 | 2017 |
Target-driven visual navigation in indoor scenes using deep reinforcement learning Y Zhu, R Mottaghi, E Kolve, JJ Lim, A Gupta, L Fei-Fei, A Farhadi ICRA, 2017 | 1811 | 2017 |
Every picture tells a story: Generating sentences from images A Farhadi, M Hejrati, MA Sadeghi, P Young, C Rashtchian, ... Computer Vision–ECCV 2010: 11th European Conference on Computer Vision …, 2010 | 1565 | 2010 |
Hollywood in Homes: Crowdsourcing Data Collection for Activity Understanding GA Sigurdsson, G Varol, X Wang, A Farhadi, I Laptev, A Gupta ECCV, 2016 | 1346 | 2016 |
Yolov3: an incremental improvement (2018) J Redmon, A Farhadi arXiv preprint arXiv:1804.02767 20, 1804 | 1223 | 1804 |
Hellaswag: Can a machine really finish your sentence? R Zellers, A Holtzman, Y Bisk, A Farhadi, Y Choi arXiv preprint arXiv:1905.07830, 2019 | 1096 | 2019 |
Defending against neural fake news R Zellers, A Holtzman, H Rashkin, Y Bisk, A Farhadi, F Roesner, Y Choi Advances in neural information processing systems 32, 2019 | 1002 | 2019 |
From recognition to cognition: Visual commonsense reasoning R Zellers, Y Bisk, A Farhadi, Y Choi Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2019 | 872 | 2019 |
Ai2-thor: An interactive 3d environment for visual ai E Kolve, R Mottaghi, W Han, E VanderBilt, L Weihs, A Herrasti, M Deitke, ... arXiv preprint arXiv:1712.05474, 2017 | 830 | 2017 |
Yolov3: An incremental improvement A Farhadi, J Redmon Computer vision and pattern recognition 1804, 1-6, 2018 | 751 | 2018 |
Ok-vqa: A visual question answering benchmark requiring external knowledge K Marino, M Rastegari, A Farhadi, R Mottaghi Proceedings of the IEEE/cvf conference on computer vision and pattern …, 2019 | 731 | 2019 |
Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference time M Wortsman, G Ilharco, SY Gadre, R Roelofs, R Gontijo-Lopes, ... International conference on machine learning, 23965-23998, 2022 | 608 | 2022 |
Fine-tuning pretrained language models: Weight initializations, data orders, and early stopping J Dodge, G Ilharco, R Schwartz, A Farhadi, H Hajishirzi, N Smith arXiv preprint arXiv:2002.06305, 2020 | 562 | 2020 |
Recognition using visual phrases MA Sadeghi, A Farhadi CVPR 2011, 1745-1752, 2011 | 550 | 2011 |