FLatten Transformer: Vision Transformer using Focused Linear Attention D Han, X Pan, Y Han, S Song, G Huang IEEE/CVF International Conference on Computer Vision 2023, 2023 | 81 | 2023 |
Pseudo-Q: Generating Pseudo Language Queries for Visual Grounding H Jiang, Y Lin, D Han, S Song, G Huang IEEE/CVF Conference on Computer Vision and Pattern Recognition 2022, 2022 | 55 | 2022 |
Contrastive Language-Image Pre-Training with Knowledge Graphs X Pan, T Ye, D Han, S Song, G Huang Advances in Neural Information Processing Systems 2022, 2022 | 30 | 2022 |
Dynamic Perceiver for Efficient Visual Recognition Y Han, D Han, Z Liu, Y Wang, X Pan, Y Pu, C Deng, J Feng, S Song, ... IEEE/CVF International Conference on Computer Vision 2023, 2023 | 24 | 2023 |
Agent Attention: On the Integration of Softmax and Linear Attention D Han, T Ye, Y Han, Z Xia, S Song, G Huang European Conference on Computer Vision 2024, 2023 | 13 | 2023 |
GSVA: Generalized Segmentation via Multimodal Large Language Models Z Xia, D Han, Y Han, X Pan, S Song, G Huang IEEE/CVF Conference on Computer Vision and Pattern Recognition 2024, 2024 | 8 | 2024 |