Vision Transformer with Deformable Attention Z Xia*, X Pan*, S Song, LE Li, G Huang IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022, 2022 | 645 | 2022 |
On the Integration of Self-Attention and Convolution X Pan, C Ge, R Lu, S Song, G Chen, Z Huang, G Huang IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2022, 2022 | 449 | 2022 |
3D Object Detection with Pointformer X Pan, Z Xia, S Song, LE Li, G Huang IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2021, 2021 | 448 | 2021 |
Implicit Semantic Data Augmentation for Deep Networks Y Wang*, X Pan*, S Song, H Zhang, C Wu, G Huang Advances in Neural Information Processing Systems (NeurIPS) 2019, 2019 | 221 | 2019 |
Regularizing Deep Networks with Semantic Data Augmentation Y Wang, G Huang, S Song, X Pan, Y Xia, C Wu IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021 | 179 | 2021 |
FLatten Transformer: Vision Transformer using Focused Linear Attention D Han*, X Pan*, Y Han, S Song, G Huang International Conference on Computer Vision (ICCV) 2023, 2023 | 175 | 2023 |
ActiveNeRF: Learning where to See with Uncertainty Estimation X Pan, Z Lai, S Song, G Huang European Conference on Computer Vision (ECCV) 2022, 2022 | 106 | 2022 |
Slide-Transformer: Hierarchical Vision Transformer with Local Self-Attention X Pan, T Ye, Z Xia, S Song, G Huang IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023, 2023 | 65 | 2023 |
Contrastive Language-Image Pre-Training with Knowledge Graphs X Pan, T Ye, D Han, S Song, G Huang Advances in Neural Information Processing Systems (NeurIPS) 2022, 2022 | 44 | 2022 |
Gsva: Generalized segmentation via multimodal large language models Z Xia, D Han, Y Han, X Pan, S Song, G Huang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 37 | 2024 |
Dynamic Perceiver for Efficient Visual Recognition Y Han, D Han, Z Liu, Y Wang, X Pan, Y Pu, C Deng, J Feng, S Song, ... International Conference on Computer Vision (ICCV) 2023, 2023 | 35 | 2023 |
A Unified Framework for Convolution-based Graph Neural Networks X Pan, S Song, G Huang URL https://openreview. net/forum, 2021 | 23* | 2021 |
Dat++: Spatially dynamic vision transformer with deformable attention Z Xia, X Pan, S Song, LE Li, G Huang arXiv preprint arXiv:2309.01430, 2023 | 21 | 2023 |
Joint Representation Learning for Text and 3D Point Cloud R Huang*, X Pan*, H Zheng, H Jiang, Z Xie, S Song, G Huang arXiv preprint arXiv:2301.07584, 2023 | 16 | 2023 |
Budgeted Training for Vision Transformer Z Xia*, X Pan*, X Jin, Y He, S Song, G Huang International Conference on Learning Representations (ICLR) 2023, 2023 | 9* | 2023 |
Bridging the divide: Reconsidering softmax and linear attention D Han, Y Pu, Z Xia, Y Han, X Pan, X Li, J Lu, S Song, G Huang arXiv preprint arXiv:2412.06590, 2024 | 2 | 2024 |
PLAM: A Plug-in Module for Flexible Graph Attention Learning X Pan, S Song, Y Chen, L Wang, G Huang Neurocomputing 480, 76-88, 2022 | 2 | 2022 |
Method and apparatus for computer vision processing C Ge, G Huang, R Lu, S Shiji, X Pan, H Yang US Patent App. 18/572,377, 2024 | | 2024 |