OmniCorpus: An Unified Multimodal Corpus of 10 Billion-Level Images Interleaved with Text Q Li, Z Chen, W Wang, W Wang, S Ye, Z Jin, G Chen, Y He, Z Gao, E Cui, ... arXiv preprint arXiv:2406.08418, 2024 | | 2024 |
ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target Simulation B Zhang, X Cai, J Yuan, D Yang, J Guo, R Xia, B Shi, M Dou, T Chen, ... International Conference on Learning Representations (ICLR), 2024 | 4 | 2024 |
How far are we to gpt-4v? closing the gap to commercial multimodal models with open-source suites Z Chen, W Wang, H Tian, S Ye, Z Gao, E Cui, W Tong, K Hu, J Luo, Z Ma, ... arXiv preprint arXiv:2404.16821, 2024 | 18 | 2024 |
UniMERNet: A Universal Network for Real-World Mathematical Expression Recognition B Wang, Z Gu, C Xu, B Zhang, B Shi, C He arXiv preprint arXiv:2404.15254, 2024 | 1 | 2024 |
Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression H Ye, C Yu, P Ye, R Xia, Y Tang, J Lu, T Chen, B Zhang IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024 | 1 | 2024 |
Not All Experts are Equal: Efficient Expert Pruning and Skipping for Mixture-of-Experts Large Language Models X Lu, Q Liu, Y Xu, A Zhou, S Huang, B Zhang, J Yan, H Li Annual Meeting of the Association for Computational Linguistics (ACL), 2024 | 1 | 2024 |
ChartX & ChartVLM: A Versatile Benchmark and Foundation Model for Complicated Chart Reasoning R Xia, B Zhang, H Ye, X Yan, Q Liu, H Zhou, Z Chen, M Dou, B Shi, J Yan, ... arXiv preprint arXiv:2402.12185, 2024 | 3 | 2024 |
AD-PT: Autonomous Driving Pre-Training with Large-scale Point Cloud Dataset J Yuan, B Zhang, X Yan, B Shi, T Chen, Y Li, Y Qiao Neural Information Processing Systems (NeurIPS), 2024 | 12 | 2024 |
Cross-Task Linearity Emerges in the Pretraining-Finetuning Paradigm Z Zhou, Z Chen, Y Chen, B Zhang, J Yan International Conference on Machine Learning (ICML), 2024 | | 2024 |
Towards Knowledge-driven Autonomous Driving X Li, Y Bai, P Cai, L Wen, D Fu, B Zhang, X Yang, X Cai, T Ma, J Guo, ... arXiv preprint arXiv:2312.04316, 2023 | 13 | 2023 |
SUG: Single-dataset Unified Generalization for 3D Point Cloud Classification S Huang, B Zhang, B Shi, H Li, Y Li, P Gao ACM International Conference on Multimedia (ACM MM), 2023 | 5 | 2023 |
StructChart: Perception, Structuring, Reasoning for Visual Chart Understanding R Xia, B Zhang, H Peng, N Liao, P Ye, B Shi, J Yan, Y Qiao arXiv preprint arXiv:2309.11268, 2023 | 8 | 2023 |
SPOT: Scalable 3D Pre-training via Occupancy Prediction for Autonomous Driving X Yan, R Chen, B Zhang, J Yuan, X Cai, B Shi, W Shao, J Yan, P Luo, ... arXiv preprint arXiv:2309.10527, 2023 | 5 | 2023 |
Rethinking cross-domain pedestrian detection: a background-focused distribution alignment framework for instance-free one-stage detectors Y Cai, B Zhang, B Li, T Chen, H Yan, J Zhang IEEE Transactions on Image Processing (TIP), 2023 | 3 | 2023 |
Performance-aware Approximation of Global Channel Pruning for Multitask CNNs H Ye, B Zhang, T Chen, J Fan, B Wang IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023 | 17 | 2023 |
A Closer Look at Few-Shot 3D Point Cloud Classification C Ye, H Zhu, B Zhang, T Chen International Journal of Computer Vision (IJCV), 2023 | 8 | 2023 |
Internlm: A multilingual language model with progressively enhanced capabilities Team, InternLM 2023-01-06)[2023-09-27]. https://github. com/InternLM/InternLM, 2023 | 131 | 2023 |
Generative Diffusion Prior for Unified Image Restoration and Enhancement B Fei, Z Lyu, L Pan, J Zhang, W Yang, T Luo, B Zhang, B Dai IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023 | 73 | 2023 |
Uni3d: A unified baseline for multi-dataset 3d object detection B Zhang, J Yuan, B Shi, T Chen, Y Li, Y Qiao IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023 | 25 | 2023 |
Bi3D: Bi-domain Active Learning for Cross-domain 3D Object Detection J Yuan, B Zhang, X Yan, T Chen, B Shi, Y Li, Y Qiao IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023 | 15 | 2023 |