Internlm: A multilingual language model with progressively enhanced capabilities ILM Team 2023-01-06)[2023-09-27]. https://github. com/InternLM/InternLM, 2023 | 147 | 2023 |
InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition P Zhang, X Dong, B Wang, Y Cao, C Xu, L Ouyang, Z Zhao, S Ding, ... arXiv preprint arXiv:2309.15112, 2023 | 100 | 2023 |
Boundary perception guidance: A scribble-supervised semantic segmentation approach B Wang, G Qi, S Tang, T Zhang, Y Wei, L Li, Y Zhang IJCAI International joint conference on artificial intelligence, 2019 | 92 | 2019 |
InternLM-XComposer2: Mastering free-form text-image composition and comprehension in vision-language large model X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, X Wei, S Zhang, ... arXiv preprint arXiv:2401.16420, 2024 | 82 | 2024 |
InternLM2 Technical Report Z Cai, M Cao, H Chen, K Chen, K Chen, X Chen, X Chen, Z Chen, Z Chen, ... arXiv preprint arXiv:2403.17297, 2024 | 58 | 2024 |
Automated pulmonary nodule detection: High sensitivity with few candidates B Wang, G Qi, S Tang, L Zhang, L Deng, Y Zhang International Conference on Medical Image Computing and Computer-Assisted …, 2018 | 52 | 2018 |
How Far Are We to GPT-4V? Closing the Gap to Commercial Multimodal Models with Open-Source Suites Z Chen, W Wang, H Tian, S Ye, Z Gao, E Cui, W Tong, K Hu, J Luo, Z Ma, ... arXiv preprint arXiv:2404.16821, 2024 | 51 | 2024 |
Opera: Alleviating hallucination in multi-modal large language models via over-trust penalty and retrospection-allocation Q Huang, X Dong, P Zhang, B Wang, C He, J Wang, D Lin, W Zhang, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 38 | 2024 |
V3det: Vast vocabulary visual detection dataset J Wang, P Zhang, T Chu, Y Cao, Y Zhou, T Wu, B Wang, C He, D Lin International Conference on Computer Vision (ICCV), 19844-19854, 2023 | 36 | 2023 |
InternLM-XComposer2-4KHD: A Pioneering Large Vision-Language Model Handling Resolutions from 336 Pixels to 4K HD X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, S Zhang, H Duan, ... arXiv preprint arXiv:2404.06512, 2024 | 35 | 2024 |
Beyond hallucinations: Enhancing lvlms through hallucination-aware direct preference optimization Z Zhao, B Wang, L Ouyang, X Dong, J Wang, C He arXiv preprint arXiv:2311.16839, 2023 | 35 | 2023 |
Vigc: Visual instruction generation and correction B Wang, F Wu, X Han, J Peng, H Zhong, P Zhang, X Dong, W Li, W Li, ... Proceedings of the AAAI Conference on Artificial Intelligence 38 (6), 5309-5317, 2024 | 33 | 2024 |
Wanjuan: A comprehensive multimodal dataset for advancing english and chinese large models C He, Z Jin, C Xu, J Qiu, B Wang, W Li, H Yan, J Wang, D Lin arXiv preprint arXiv:2308.10755, 2023 | 25 | 2023 |
Detection and tracking based tubelet generation for video object detection B Wang, S Tang, JB Xiao, QF Yan, YD Zhang Journal of Visual Communication and Image Representation 58, 102-111, 2019 | 16 | 2019 |
Unified interactive image matting SDH Yang, B Wang, W Li, YQ Lin, C He arXiv preprint arXiv:2205.08324, 2022 | 11 | 2022 |
Pedestrian detection based on region proposal fusion B Wang, S Tang, R Zhao, W Liu, Y Cen 2015 IEEE 17th International Workshop on Multimedia Signal Processing (MMSP …, 2015 | 11 | 2015 |
Opendatalab: Empowering general artificial intelligence with open datasets C He, W Li, Z Jin, C Xu, B Wang, D Lin arXiv preprint arXiv:2407.13773, 2024 | 8 | 2024 |
Spatiotemporal breast mass detection network (MD-Net) in 4D DCE-MRI images L Deng, S Tang, H Fu, B Wang, Y Zhang International Conference on Medical Image Computing and Computer-Assisted …, 2019 | 7 | 2019 |
Parrot Captions Teach CLIP to Spot Text Y Lin, C He, AJ Wang, B Wang, W Li, MZ Shou arXiv preprint arXiv:2312.14232, 2023 | 4 | 2023 |
Cycle-consistent learning for weakly supervised semantic segmentation B Wang, Y Qiao, D Lin, SDH Yang, W Li Proceedings of the 3rd International Workshop on Human-Centric Multimedia …, 2022 | 4 | 2022 |