Supervision exists everywhere: A data efficient contrastive language-image pre-training paradigm Y Li*, F Liang*, L Zhao*, Y Cui, W Ouyang, J Shao, F Yu, J Yan International Conference on Learning Representations(ICLR) 2022, 2021 | 440 | 2021 |
Emu: Generative Pretraining in Multimodality Q Sun*, Q Yu*, Y Cui*, F Zhang*, X Zhang*, Y Wang, H Gao, J Liu, ... The Twelfth International Conference on Learning Representations, 2023 | 182* | 2023 |
Emu2: Generative multimodal models are in-context learners Q Sun*, Y Cui*, X Zhang*, F Zhang*, Q Yu*, Z Luo, Y Wang, Y Rao, J Liu, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 145* | 2023 |
Democratizing contrastive language-image pre-training: A clip benchmark of data, model, and supervision Y Cui, L Zhao, F Liang, Y Li, J Shao ICML First Workshop on Pre-training 2022, 2022 | 37 | 2022 |
Capsfusion: Rethinking image-text data at scale Q Yu, Q Sun, X Zhang, Y Cui, F Zhang, Y Cao, X Wang, J Liu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 35 | 2024 |
Fast-BEV: A Fast and Strong Bird's-Eye View Perception Baseline Y Li, B Huang, Z Chen, Y Cui, F Liang, M Shen, F Liu, E Xie, L Sheng, ... IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023 | 34 | 2023 |
Multi-modal gait recognition via effective spatial-temporal feature fusion Y Cui, Y Kang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 31 | 2023 |
Eva-clip-18b: Scaling clip to 18 billion parameters Q Sun, J Wang, Q Yu, Y Cui, F Zhang, X Zhang, X Wang arXiv preprint arXiv:2402.04252, 2024 | 21 | 2024 |
Emu3: Next-token prediction is all you need X Wang, X Zhang, Z Luo, Q Sun, Y Cui, J Wang, F Zhang, Y Wang, Z Li, ... arXiv preprint arXiv:2409.18869, 2024 | 12 | 2024 |
GaitTransformer: Multiple-temporal-scale transformer for cross-view gait recognition Y Cui, Y Kang 2022 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2022 | 11 | 2022 |
Unveiling Encoder-Free Vision-Language Models H Diao*, Y Cui*, X Li, Y Wang, H Lu, X Wang arXiv preprint arXiv:2406.11832, 2024 | 4 | 2024 |
Learning Multiple Granularity Features for Unsupervised Person Re-Identification S Wang*, Y Cui*, Y Kang 2022 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2022 | | 2022 |