Llama-adapter v2: Parameter-efficient visual instruction model P Gao, J Han, R Zhang, Z Lin, S Geng, A Zhou, W Zhang, P Lu, C He, ... arXiv preprint arXiv:2304.15010, 2023 | 346 | 2023 |
Mmbench: Is your multi-modal model an all-around player? Y Liu, H Duan, Y Zhang, B Li, S Zhang, W Zhao, Y Yuan, J Wang, C He, ... arXiv preprint arXiv:2307.06281, 2023 | 287 | 2023 |
Semantic segmentation-based building footprint extraction using very high-resolution satellite images and multi-source GIS data W Li, C He, J Fang, J Zheng, H Fu, L Yu Remote Sensing 11 (4), 403, 2019 | 224 | 2019 |
Sharegpt4v: Improving large multi-modal models with better captions L Chen, J Li, X Dong, P Zhang, C He, J Wang, F Zhao, D Lin arXiv preprint arXiv:2311.12793, 2023 | 146 | 2023 |
9-Pflops nonlinear earthquake simulation on Sunway TaihuLight: enabling depiction of 18-Hz and 8-meter scenarios H Fu, C He, B Chen, Z Yin, Z Zhang, W Zhang, T Zhang, W Xue, W Liu, ... Proceedings of the International Conference for High Performance Computing …, 2017 | 138 | 2017 |
Persformer: 3d lane detection via perspective transformer and the openlane benchmark L Chen, C Sima, Y Li, Z Zheng, J Xu, X Geng, H Li, C He, J Shi, Y Qiao, ... European Conference on Computer Vision, 550-567, 2022 | 119 | 2022 |
Internlm-xcomposer: A vision-language large model for advanced text-image comprehension and composition P Zhang, XDB Wang, Y Cao, C Xu, L Ouyang, Z Zhao, S Ding, S Zhang, ... arXiv preprint arXiv:2309.15112, 2023 | 91 | 2023 |
Internvid: A large-scale video-text dataset for multimodal understanding and generation Y Wang, Y He, Y Li, K Li, J Yu, X Ma, X Li, G Chen, X Chen, Y Wang, C He, ... arXiv preprint arXiv:2307.06942, 2023 | 85 | 2023 |
Influence selection for active learning Z Liu, H Ding, H Zhong, W Li, J Dai, C He Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 80 | 2021 |
Internlm-xcomposer2: Mastering free-form text-image composition and comprehension in vision-language large model X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, X Wei, S Zhang, ... arXiv preprint arXiv:2401.16420, 2024 | 68 | 2024 |
Global-scale associations of vegetation phenology with rainfall and temperature at a high spatio-temporal resolution N Clinton, L Yu, H Fu, C He, P Gong Remote Sensing 6 (8), 7320-7338, 2014 | 49 | 2014 |
Semantic segmentation based building extraction method using multi-source gis map datasets and satellite imagery W Li, C He, J Fang, H Fu Proceedings of the IEEE conference on computer vision and pattern …, 2018 | 48 | 2018 |
Think twice before driving: Towards scalable decoders for end-to-end autonomous driving X Jia, P Wu, L Chen, J Xie, C He, J Yan, H Li Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 45 | 2023 |
Internlm2 technical report Z Cai, M Cao, H Chen, K Chen, K Chen, X Chen, X Chen, Z Chen, Z Chen, ... arXiv preprint arXiv:2403.17297, 2024 | 44 | 2024 |
Refactoring and optimizing the community atmosphere model (CAM) on the sunway taihulight supercomputer H Fu, J Liao, W Xue, L Wang, D Chen, L Gu, J Xu, N Ding, X Wang, C He, ... SC'16: Proceedings of the International Conference for High Performance …, 2016 | 41 | 2016 |
How far are we to gpt-4v? closing the gap to commercial multimodal models with open-source suites Z Chen, W Wang, H Tian, S Ye, Z Gao, E Cui, W Tong, K Hu, J Luo, Z Ma, ... arXiv preprint arXiv:2404.16821, 2024 | 39 | 2024 |
Joint semantic-geometric learning for polygonal building segmentation W Li, W Zhao, H Zhong, C He, D Lin Proceedings of the AAAI Conference on Artificial Intelligence 35 (3), 1958-1965, 2021 | 35 | 2021 |
Opera: Alleviating hallucination in multi-modal large language models via over-trust penalty and retrospection-allocation Q Huang, X Dong, P Zhang, B Wang, C He, J Wang, D Lin, W Zhang, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 34 | 2024 |
V3det: Vast vocabulary visual detection dataset J Wang, P Zhang, T Chu, Y Cao, Y Zhou, T Wu, B Wang, C He, D Lin Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 34 | 2023 |
swcaffe: A parallel framework for accelerating deep learning applications on sunway taihulight L Li, J Fang, H Fu, J Jiang, W Zhao, C He, X You, G Yang 2018 IEEE International Conference on Cluster Computing (CLUSTER), 413-422, 2018 | 34 | 2018 |