Rewarded soups: towards Pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards A Rame, G Couairon, M Shukor, C Dancette, JB Gaya, L Soulier, M Cord NeurIPS 2023, 2023 | 58 | 2023 |
UnIVAL: Unified Model for Image, Video, Audio and Language Tasks M Shukor, C Dancette, A Rame, M Cord Transactions on Machine Learning Research (TMLR), 2023, 2023 | 20* | 2023 |
Transformer decoders with multimodal regularization for cross-modal food retrieval M Shukor, G Couairon, A Grechka, M Cord Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 18 | 2022 |
Synthetic training data generation for deep learning based quality inspection P Gutierrez, M Luschkova, A Cordier, M Shukor, M Schappert, T Dahmen International Conference on Quality Control by Artificial Vision (QCAV 2021 …, 2021 | 18 | 2021 |
Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment M Shukor, G Couairon, M Cord BMVC 2022, 2022 | 16 | 2022 |
eP-ALM: Efficient Perceptual Augmentation of Language Models M Shukor, C Dancette, M Cord ICCV 2023, 2023 | 14 | 2023 |
Beyond task performance: Evaluating and reducing the flaws of large multimodal models with in-context learning M Shukor, A Rame, C Dancette, M Cord ICLR 2024, 2023 | 10 | 2023 |
Semantic unfolding of stylegan latent space M Shukor, X Yao, BB Damodaran, P Hellier 2022 IEEE International Conference on Image Processing (ICIP), 221-225, 2022 | 10* | 2022 |
Vision and structured-language pretraining for cross-modal food retrieval M Shukor, N Thome, M Cord Computer Vision and Image Understanding 247, 104071, 2024 | 8* | 2024 |
Video coding using learned latent gan compression M Shukor, BB Damodaran, X Yao, P Hellier Proceedings of the 30th ACM International Conference on Multimedia, 2239-2248, 2022 | 5 | 2022 |
Improved baselines for data-efficient perceptual augmentation of llms T Vallaeys, M Shukor, M Cord, J Verbeek arXiv preprint arXiv:2403.13499, 2024 | 4 | 2024 |
What Makes Multimodal In-Context Learning Work? FB Baldassini, M Shukor, M Cord, L Soulier, B Piwowarski Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 4 | 2024 |
A Concept-Based Explainability Framework for Large Multimodal Models J Parekh, P Khayatan, M Shukor, A Newson, M Cord arXiv preprint arXiv:2406.08074, 2024 | | 2024 |
Zero-Shot Image Segmentation via Recursive Normalized Cut on Diffusion Features P Couairon, M Shukor, JE Haugeard, M Cord, N Thome arXiv preprint arXiv:2406.02842, 2024 | | 2024 |
Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs M Shukor, M Cord arXiv preprint arXiv:2405.16700, 2024 | | 2024 |
FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models BT Corradini, M Shukor, P Couairon, G Couairon, F Scarselli, M Cord arXiv preprint arXiv:2403.20105, 2024 | | 2024 |
Supplementary material for eP-ALM: Efficient Perceptual Augmentation of Language Models M Shukor, C Dancette, M Cord | | |