关注
Weiyun Wang
Weiyun Wang
Shanghai AI Laboratory; Fudan University
在 pjlab.org.cn 的电子邮件经过验证
标题
引用次数
引用次数
年份
Interngpt: Solving vision-centric tasks by interacting with chatgpt beyond language
Z Liu, Y He, W Wang, W Wang, Y Wang, S Chen, Q Zhang, Z Lai, Y Yang, ...
arXiv preprint arXiv:2305.05662, 2023
662023
How far are we to gpt-4v? closing the gap to commercial multimodal models with open-source suites
Z Chen, W Wang, H Tian, S Ye, Z Gao, E Cui, W Tong, K Hu, J Luo, Z Ma, ...
arXiv preprint arXiv:2404.16821, 2024
542024
The all-seeing project: Towards panoptic visual recognition and understanding of the open world
W Wang, M Shi, Q Li, W Wang, Z Huang, L Xing, Z Chen, H Li, X Zhu, ...
The Twelfth International Conference on Learning Representations (ICLR 2024), 2023
402023
Mm-interleaved: Interleaved image-text generative modeling via multi-modal feature synchronizer
C Tian, X Zhu, Y Xiong, W Wang, Z Chen, W Wang, Y Chen, L Lu, T Lu, ...
arXiv preprint arXiv:2401.10208, 2024
212024
Vision-rwkv: Efficient and scalable visual perception with rwkv-like architectures
Y Duan, W Wang, Z Chen, X Zhu, L Lu, T Lu, Y Qiao, H Li, J Dai, W Wang
arXiv preprint arXiv:2403.02308, 2024
162024
The all-seeing project v2: Towards general relation comprehension of the open world
W Wang, Y Ren, H Luo, T Li, C Yan, Z Chen, W Wang, Q Li, L Lu, X Zhu, ...
The 18th European Conference on Computer Vision ECCV 2024, 2024
122024
Demystify transformers & convolutions in modern image deep networks
J Dai, M Shi, W Wang, S Wu, L Xing, W Wang, X Zhu, L Lu, J Zhou, ...
arXiv preprint arXiv:2211.05781, 2022
122022
Cliptext: A new paradigm for zero-shot text classification
L Qin, W Wang, Q Chen, W Che
Findings of the Association for Computational Linguistics: ACL 2023, 1077-1088, 2023
32023
MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Diversity
Y Liu, Y Cao, Z Gao, W Wang, Z Chen, W Wang, H Tian, L Lu, X Zhu, T Lu, ...
arXiv preprint arXiv:2407.15838, 2024
2024
OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text
Q Li, Z Chen, W Wang, W Wang, S Ye, Z Jin, G Chen, Y He, Z Gao, E Cui, ...
arXiv preprint arXiv:2406.08418, 2024
2024
Needle In A Multimodal Haystack
W Wang, S Zhang, Y Ren, Y Duan, T Li, S Liu, M Hu, Z Chen, K Zhang, ...
arXiv preprint arXiv:2406.07230, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–11