Interngpt: Solving vision-centric tasks by interacting with chatgpt beyond language Z Liu, Y He, W Wang, W Wang, Y Wang, S Chen, Q Zhang, Z Lai, Y Yang, ... arXiv preprint arXiv:2305.05662, 2023 | 66 | 2023 |
How far are we to gpt-4v? closing the gap to commercial multimodal models with open-source suites Z Chen, W Wang, H Tian, S Ye, Z Gao, E Cui, W Tong, K Hu, J Luo, Z Ma, ... arXiv preprint arXiv:2404.16821, 2024 | 54 | 2024 |
The all-seeing project: Towards panoptic visual recognition and understanding of the open world W Wang, M Shi, Q Li, W Wang, Z Huang, L Xing, Z Chen, H Li, X Zhu, ... The Twelfth International Conference on Learning Representations (ICLR 2024), 2023 | 40 | 2023 |
Mm-interleaved: Interleaved image-text generative modeling via multi-modal feature synchronizer C Tian, X Zhu, Y Xiong, W Wang, Z Chen, W Wang, Y Chen, L Lu, T Lu, ... arXiv preprint arXiv:2401.10208, 2024 | 21 | 2024 |
Vision-rwkv: Efficient and scalable visual perception with rwkv-like architectures Y Duan, W Wang, Z Chen, X Zhu, L Lu, T Lu, Y Qiao, H Li, J Dai, W Wang arXiv preprint arXiv:2403.02308, 2024 | 16 | 2024 |
The all-seeing project v2: Towards general relation comprehension of the open world W Wang, Y Ren, H Luo, T Li, C Yan, Z Chen, W Wang, Q Li, L Lu, X Zhu, ... The 18th European Conference on Computer Vision ECCV 2024, 2024 | 12 | 2024 |
Demystify transformers & convolutions in modern image deep networks J Dai, M Shi, W Wang, S Wu, L Xing, W Wang, X Zhu, L Lu, J Zhou, ... arXiv preprint arXiv:2211.05781, 2022 | 12 | 2022 |
Cliptext: A new paradigm for zero-shot text classification L Qin, W Wang, Q Chen, W Che Findings of the Association for Computational Linguistics: ACL 2023, 1077-1088, 2023 | 3 | 2023 |
MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity Y Liu, Y Cao, Z Gao, W Wang, Z Chen, W Wang, H Tian, L Lu, X Zhu, T Lu, ... arXiv preprint arXiv:2407.15838, 2024 | | 2024 |
OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text Q Li, Z Chen, W Wang, W Wang, S Ye, Z Jin, G Chen, Y He, Z Gao, E Cui, ... arXiv preprint arXiv:2406.08418, 2024 | | 2024 |
Needle In A Multimodal Haystack W Wang, S Zhang, Y Ren, Y Duan, T Li, S Liu, M Hu, Z Chen, K Zhang, ... arXiv preprint arXiv:2406.07230, 2024 | | 2024 |