关注
Mustafa Shukor
Mustafa Shukor
PhD Student at Sorbonne University
在 sorbonne-universite.fr 的电子邮件经过验证
标题
引用次数
引用次数
年份
Rewarded soups: towards Pareto-optimal alignment by interpolating weights fine-tuned on diverse rewards
A Rame, G Couairon, M Shukor, C Dancette, JB Gaya, L Soulier, M Cord
NeurIPS 2023, 2023
582023
UnIVAL: Unified Model for Image, Video, Audio and Language Tasks
M Shukor, C Dancette, A Rame, M Cord
Transactions on Machine Learning Research (TMLR), 2023, 2023
20*2023
Transformer decoders with multimodal regularization for cross-modal food retrieval
M Shukor, G Couairon, A Grechka, M Cord
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
182022
Synthetic training data generation for deep learning based quality inspection
P Gutierrez, M Luschkova, A Cordier, M Shukor, M Schappert, T Dahmen
International Conference on Quality Control by Artificial Vision (QCAV 2021 …, 2021
182021
Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical Alignment
M Shukor, G Couairon, M Cord
BMVC 2022, 2022
162022
eP-ALM: Efficient Perceptual Augmentation of Language Models
M Shukor, C Dancette, M Cord
ICCV 2023, 2023
142023
Beyond task performance: Evaluating and reducing the flaws of large multimodal models with in-context learning
M Shukor, A Rame, C Dancette, M Cord
ICLR 2024, 2023
102023
Semantic unfolding of stylegan latent space
M Shukor, X Yao, BB Damodaran, P Hellier
2022 IEEE International Conference on Image Processing (ICIP), 221-225, 2022
10*2022
Vision and structured-language pretraining for cross-modal food retrieval
M Shukor, N Thome, M Cord
Computer Vision and Image Understanding 247, 104071, 2024
8*2024
Video coding using learned latent gan compression
M Shukor, BB Damodaran, X Yao, P Hellier
Proceedings of the 30th ACM International Conference on Multimedia, 2239-2248, 2022
52022
Improved baselines for data-efficient perceptual augmentation of llms
T Vallaeys, M Shukor, M Cord, J Verbeek
arXiv preprint arXiv:2403.13499, 2024
42024
What Makes Multimodal In-Context Learning Work?
FB Baldassini, M Shukor, M Cord, L Soulier, B Piwowarski
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
42024
A Concept-Based Explainability Framework for Large Multimodal Models
J Parekh, P Khayatan, M Shukor, A Newson, M Cord
arXiv preprint arXiv:2406.08074, 2024
2024
Zero-Shot Image Segmentation via Recursive Normalized Cut on Diffusion Features
P Couairon, M Shukor, JE Haugeard, M Cord, N Thome
arXiv preprint arXiv:2406.02842, 2024
2024
Implicit Multimodal Alignment: On the Generalization of Frozen LLMs to Multimodal Inputs
M Shukor, M Cord
arXiv preprint arXiv:2405.16700, 2024
2024
FreeSeg-Diff: Training-Free Open-Vocabulary Segmentation with Diffusion Models
BT Corradini, M Shukor, P Couairon, G Couairon, F Scarselli, M Cord
arXiv preprint arXiv:2403.20105, 2024
2024
Supplementary material for eP-ALM: Efficient Perceptual Augmentation of Language Models
M Shukor, C Dancette, M Cord
系统目前无法执行此操作,请稍后再试。
文章 1–17