Enriching word vectors with subword information P Bojanowski, E Grave, A Joulin, T Mikolov Transactions of the Association for Computational Linguistics 5, 135--146, 2017 | 12755 | 2017 |
Llama: Open and efficient foundation language models H Touvron, T Lavril, G Izacard, X Martinet, MA Lachaux, T Lacroix, ... arXiv preprint arXiv:2302.13971, 2023 | 7816 | 2023 |
Bag of tricks for efficient text classification A Joulin, E Grave, P Bojanowski, T Mikolov Proceedings of the 15th Conference of the European Chapter of the …, 2017 | 6189 | 2017 |
Emerging properties in self-supervised vision transformers M Caron, H Touvron, I Misra, H Jégou, J Mairal, P Bojanowski, A Joulin Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 4596 | 2021 |
Unsupervised learning of visual features by contrasting cluster assignments M Caron, I Misra, J Mairal, P Goyal, P Bojanowski, A Joulin Advances in neural information processing systems 33, 9912-9924, 2020 | 3850 | 2020 |
Deep clustering for unsupervised learning of visual features M Caron, P Bojanowski, A Joulin, M Douze Proceedings of the European conference on computer vision (ECCV), 132-149, 2018 | 3157 | 2018 |
Learning word vectors for 157 languages E Grave, P Bojanowski, P Gupta, A Joulin, T Mikolov Proceedings of Language Resources and Evaluation Conference (LREC), 2018 | 1857 | 2018 |
Advances in pre-training distributed word representations T Mikolov, E Grave, P Bojanowski, C Puhrsch, A Joulin Proceedings of Language Resources and Evaluation Conference (LREC), 2018 | 1764 | 2018 |
FastText.zip: Compressing text classification models A Joulin, E Grave, P Bojanowski, M Douze, H Jégou, T Mikolov arXiv preprint arXiv:1612.03651, 2016 | 1637 | 2016 |
Dinov2: Learning robust visual features without supervision M Oquab, T Darcet, T Moutakanni, H Vo, M Szafraniec, V Khalidov, ... arXiv preprint arXiv:2304.07193, 2023 | 1393* | 2023 |
Towards ai-complete question answering: A set of prerequisite toy tasks J Weston, A Bordes, S Chopra, AM Rush, B Van Merriënboer, A Joulin, ... arXiv preprint arXiv:1502.05698, 2015 | 1297 | 2015 |
Deep fragment embeddings for bidirectional image sentence mapping A Karpathy, A Joulin, L Fei Fei Advances in neural information processing systems, 1889-1897, 2014 | 1101 | 2014 |
Levit: a vision transformer in convnet's clothing for faster inference B Graham, A El-Nouby, H Touvron, P Stock, A Joulin, H Jégou, M Douze Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 731* | 2021 |
Beyond English-Centric Multilingual Machine Translation A Fan, S Bhosale, H Schwenk, Z Ma, A El-Kishky, S Goyal, M Baines, ... J. Mach. Learn. Res. 22 (107), 1-48, 2021 | 708 | 2021 |
Resmlp: Feedforward networks for image classification with data-efficient training H Touvron, P Bojanowski, M Caron, M Cord, A El-Nouby, E Grave, ... IEEE transactions on pattern analysis and machine intelligence 45 (4), 5314-5321, 2022 | 691* | 2022 |
Libri-Light: A Benchmark for ASR with Limited or No Supervision J Kahn, M Rivière, W Zheng, E Kharitonov, Q Xu, PE Mazaré, J Karadayi, ... Proceedings of the International Conference on Acoustics, Speech, and Signal …, 2020 | 593 | 2020 |
Discriminative clustering for image co-segmentation A Joulin, F Bach, J Ponce Proceedings of the Conference on Computer Vision and Pattern Recognition …, 2010 | 588 | 2010 |
Reducing Transformer Depth on Demand with Structured Dropout A Fan, E Grave, A Joulin International Conference on Learning Representations (ICLR), 2020 | 584 | 2020 |
CCNet: Extracting high quality monolingual datasets from web crawl data G Wenzek, MA Lachaux, A Conneau, V Chaudhary, F Guzman, A Joulin, ... Proceedings of Language Resources and Evaluation Conference (LREC), 2020 | 550 | 2020 |
Imagebind: One embedding space to bind them all G Rohi, A El-Nouby, Z Liu, M Singh, KV Alwala, A Joulin, I Misra Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 545* | 2023 |