An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale A Dosovitskiy*, L Beyer*, A Kolesnikov*, D Weissenborn*, X Zhai*, ... ICLR 2020, 2020 | 38959 | 2020 |
In Defense of the Triplet Loss for Person Re-Identification A Hermans*, L Beyer*, B Leibe, *equal contribution arXiv preprint arXiv:1703.07737, 2017 | 3736 | 2017 |
MLP-Mixer: An all-MLP architecture for vision I Tolstikhin*, N Houlsby*, A Kolesnikov*, L Beyer*, X Zhai, T Unterthiner, ... NeurIPS 2021, 2021 | 2366 | 2021 |
Big Transfer (BiT): General Visual Representation Learning A Kolesnikov*, L Beyer*, X Zhai*, J Puigcerver, J Yung, S Gelly, ... ECCV 2020, 2019 | 1320 | 2019 |
SL: Self-Supervised Semi-Supervised Learning X Zhai*, A Oliver*, A Kolesnikov*, L Beyer*, *equal contribution ICCV 2019, 2019 | 989* | 2019 |
Scaling Vision Transformers X Zhai*, A Kolesnikov*, N Houlsby, L Beyer*, *equal contribution CVPR 2022, 2021 | 966 | 2021 |
Revisiting Self-Supervised Visual Representation Learning A Kolesnikov*, X Zhai*, L Beyer*, *equal contribution CVPR 2019, 2019 | 836 | 2019 |
How to train your ViT? Data, Augmentation, and Regularization in Vision Transformers A Steiner*, A Kolesnikov*, X Zhai*, R Wightman, J Uszkoreit, L Beyer*, ... TMLR, 2021 | 544 | 2021 |
Pali: A jointly-scaled multilingual language-image model X Chen, X Wang, S Changpinyo, AJ Piergiovanni, P Padlewski, D Salz, ... ICLR 2023, 2022 | 469 | 2022 |
LiT: Zero-Shot Transfer with Locked-image Text Tuning X Zhai*+, X Wang*, B Mustafa*, A Steiner*, D Keysers, A Kolesnikov, ... CVPR 2022, 2021 | 443 | 2021 |
A large-scale study of representation learning with the visual task adaptation benchmark X Zhai, J Puigcerver, A Kolesnikov, P Ruyssen, C Riquelme, M Lucic, ... arXiv preprint arXiv:1910.04867, 2019 | 359* | 2019 |
Are we done with ImageNet? L Beyer*, OJ Hénaff*, A Kolesnikov*, X Zhai*, A Oord*, *equal contribution arXiv preprint arXiv:2006.07159, 2020 | 349 | 2020 |
Scaling vision transformers to 22 billion parameters M Dehghani, J Djolonga, B Mustafa, P Padlewski, J Heek, J Gilmer, ... International Conference on Machine Learning, 7480-7512, 2023 | 329 | 2023 |
SPENCER: A Socially Aware Service Robot for Passenger Guidance and Help in Busy Airports L Palmieri, U Rafi, M van Rooij, B Okal, M Magnusson, T Linder, M Lohse, ... Proceedings of the 10th Conference on Field and Service Robotics, FSR 2015 …, 2015 | 308* | 2015 |
The STRANDS project: Long-term autonomy in everyday environments N Hawes, C Burbridge, F Jovan, L Kunze, B Lacerda, L Mudrova, J Young, ... IEEE Robotics & Automation Magazine 24 (3), 146-156, 2017 | 257 | 2017 |
Knowledge distillation: A good teacher is patient and consistent L Beyer*, X Zhai*, A Royer, L Markeeva, R Anil, A Kolesnikov*, ... CVPR 2022, 2021 | 255 | 2021 |
A Simple Single-Scale Vision Transformer for Object Detection and Instance Segmentation W Chen, X Du, F Yang, L Beyer, X Zhai, TY Lin, H Chen, J Li, X Song, ... ECCV 2022, 711-727, 2022 | 196* | 2022 |
Sigmoid loss for language image pre-training X Zhai, B Mustafa, A Kolesnikov, L Beyer Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 165 | 2023 |
Kubric: A scalable dataset generator K Greff, F Belletti, L Beyer, C Doersch, Y Du, D Duckworth, DJ Fleet, ... CVPR 2022, 3749-3761, 2022 | 148 | 2022 |
On Robustness and Transferability of Convolutional Neural Networks J Djolonga, J Yung, M Tschannen, R Romijnders, L Beyer, A Kolesnikov, ... CVPR 2021, 2020 | 136 | 2020 |