Augmented language models: a survey G Mialon, R Dessì, M Lomeli, C Nalmpantis, R Pasunuru, R Raileanu, ... TMLR, 2023 | 375 | 2023 |
Graphit: Encoding graph structure in transformers G Mialon, D Chen, M Selosse, J Mairal arXiv preprint arXiv:2106.05667, 2021 | 133 | 2021 |
A kernel perspective for regularizing deep neural networks A Bietti, G Mialon, D Chen, J Mairal ICML 2019, 664-674, 2019 | 95* | 2019 |
Avi Schwarzschild, Andrew Gordon Wilson, Jonas Geiping, Quentin Garrido, Pierre Fernandez, Amir Bar, Hamed Pirsiavash, Yann LeCun, and Micah Goldblum R Balestriero, M Ibrahim, V Sobal, A Morcos, S Shekhar, T Goldstein, ... A cookbook of self-supervised learning 2, 2023 | 64* | 2023 |
A Trainable Optimal Transport Embedding for Feature Aggregation and its Relationship to Attention G Mialon, D Chen, A d'Aspremont, J Mairal ICLR 2021, 2020 | 62 | 2020 |
GAIA: a benchmark for General AI Assistants G Mialon, C Fourrier, C Swift, T Wolf, Y LeCun, T Scialom ICLR 2024, 2023 | 42 | 2023 |
Self-supervised learning with lie symmetries for partial differential equations G Mialon, Q Garrido, H Lawrence, D Rehman, Y LeCun, B Kiani Advances in Neural Information Processing Systems 36, 28973-29004, 2023 | 9* | 2023 |
Variance covariance regularization enforces pairwise independence in self-supervised representations G Mialon, R Balestriero, Y LeCun | 9 | 2022 |
A Cookbook of Self-Supervised Learning,(2023) R Balestriero, M Ibrahim, V Sobal, A Morcos, S Shekhar, T Goldstein, ... arXiv preprint arXiv:2304.12210, 0 | 7 | |
Worldsense: A synthetic benchmark for grounded reasoning in large language models Y Benchekroun, M Dervishi, M Ibrahim, JB Gaya, X Martinet, G Mialon, ... arXiv preprint arXiv:2311.15930, 2023 | 6 | 2023 |
Screening data points in empirical risk minimization via ellipsoidal regions and safe loss functions G Mialon, J Mairal, A d’Aspremont International Conference on Artificial Intelligence and Statistics, 3610-3620, 2020 | 1 | 2020 |
On Inductive Biases for Machine Learning in Data Constrained Settings (PhD thesis) G Mialon arXiv preprint arXiv:2302.10692, 2022 | | 2022 |