PointCLIP: Point Cloud Understanding by CLIP R Zhang*, Z Guo*, W Zhang, K Li, X Miao, B Cui, Y Qiao, P Gao, H Li CVPR 2022, 8552-8562, 2022 | 340 | 2022 |
Point-M2AE: Multi-scale Masked Autoencoders for Hierarchical Point Cloud Pre-training R Zhang, Z Guo, P Gao, R Fang, B Zhao, D Wang, Y Qiao, H Li NeurIPS 2022, 2022 | 189 | 2022 |
Personalize Segment Anything Model with One Shot R Zhang, Z Jiang, Z Guo, S Yan, J Pan, H Dong, P Gao, H Li ICLR 2024, 2023 | 112 | 2023 |
MonoDETR: Depth-guided Transformer for Monocular 3D Object Detection R Zhang, H Qiu, T Wang, Z Guo, Z Cui, Y Qiao, H Li, P Gao ICCV 2023, 9155-9166, 2023 | 111 | 2023 |
CALIP: Zero-Shot Enhancement of CLIP with Parameter-free Attention Z Guo, R Zhang, L Qiu, X Ma, X Miao, X He, B Cui AAAI 2023 Oral, 2022 | 70 | 2022 |
PointCLIP V2: Prompting CLIP and GPT for Powerful 3D Open-world Learning X Zhu, R Zhang, B He, Z Guo, Z Zeng, Z Qin, S Zhang, P Gao ICCV 2023, 2023 | 67 | 2023 |
Parameter is Not All You Need: Starting from Non-parametric Networks for 3D Point Cloud Analysis R Zhang, L Wang, Z Guo, Y Wang, P Gao, H Li, J Shi CVPR 2023, 2023 | 66* | 2023 |
ImageBind-LLM: Multi-modality Instruction Tuning J Han, R Zhang, W Shao, P Gao, P Xu, H Xiao, K Zhang, C Liu, S Wen, ... arXiv preprint arXiv:2309.03905, 2023 | 65 | 2023 |
Point-Bind & Point-LLM: Aligning Point Cloud with Multi-modality for 3D Understanding, Generation, and Instruction Following Z Guo, R Zhang, X Zhu, Y Tang, X Ma, J Han, K Chen, P Gao, X Li, H Li, ... arXiv preprint arXiv:2309.00615, 2023 | 57 | 2023 |
Can Language Understand Depth? R Zhang, Z Zeng, Z Guo, Y Li ACM MM 2022, 6868-6874, 2022 | 48 | 2022 |
VT-CLIP: Enhancing Vision-Language Models with Visual-guided Texts L Qiu, R Zhang, Z Guo, Z Zeng, Y Li, G Zhang arXiv preprint arXiv:2112.02399, 2021 | 43 | 2021 |
Joint-MAE: 2D-3D Joint Masked Autoencoders for 3D Point Cloud Pre-training Z Guo, R Zhang, L Qiu, X Li, PA Heng IJCAI 2023, 2023 | 36 | 2023 |
Mathverse: Does your multi-modal llm truly see the diagrams in visual math problems? R Zhang, D Jiang, Y Zhang, H Lin, Z Guo, P Qiu, A Zhou, P Lu, KW Chang, ... ECCV 2024, 2024 | 28 | 2024 |
DS-Point: A Dual-Scale 3D Framework for Point Cloud Understanding R Zhang*, Z Zeng*, Z Guo*, B Chen, G Zhang, X Liu SMC 2023, 5046-5051, 2023 | 25* | 2023 |
Nearest Neighbors Meet Deep Neural Networks for Point Cloud Analysis R Zhang, L Wang, Z Guo, J Shi WACV 2023, 1246-1255, 2023 | 14 | 2023 |
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation S Yan*, R Zhang*, Z Guo*, W Chen, W Zhang, H Li, Y Qiao, Z He, P Gao AAAI 2024, 2023 | 12 | 2023 |
No Time to Train: Empowering Non-Parametric Networks for Few-shot 3D Scene Segmentation X Zhu, R Zhang, B He, Z Guo, J Liu, H Xiao, C Fu, H Dong, P Gao CVPR 2024, 2024 | 3* | 2024 |
LLM-Assisted Multi-Teacher Continual Learning for Visual Question Answering in Robotic Surgery K Chen, Y Du, T You, M Islam, Z Guo, Y Jin, G Chen, PA Heng ICRA 2024, 2024 | 3 | 2024 |
SignVTCL: Multi-Modal Continuous Sign Language Recognition Enhanced by Visual-Textual Contrastive Learning H Chen, J Wang, Z Guo, J Li, D Zhou, B Wu, C Guan, G Chen, PA Heng BMVC 2024, 2024 | 1 | 2024 |
MAVIS: Mathematical Visual Instruction Tuning R Zhang, X Wei, D Jiang, Y Zhang, Z Guo, C Tong, J Liu, A Zhou, B Wei, ... arXiv preprint arXiv:2407.08739, 2024 | | 2024 |