Rmt: Retentive networks meet vision transformers Q Fan, H Huang, M Chen, H Liu, R He CVPR, 2024, 2023 | 31 | 2023 |
Rethinking local perception in lightweight vision transformer Q Fan, H Huang, J Guan, R He arXiv preprint arXiv:2303.17803, 2023 | 31 | 2023 |
Lightweight Vision Transformer with Bidirectional Interaction Q Fan, H Huang, X Zhou, R He NeurIPS, 2023, 2023 | 15 | 2023 |
Video-teller: Enhancing cross-modal generation with fusion and decoupling H Liu, Q Fan, T Liu, L Yang, Y Tao, H Huang, R He, H Yang arXiv preprint arXiv:2310.04991, 2023 | 6 | 2023 |
Video-csr: Complex video digest creation for visual-language models T Liu, Y Tao, H Liu, Q Fan, D Zhou, H Huang, R He, H Yang ACL, 2024, 2023 | 3 | 2023 |
ViTAR: Vision Transformer with Any Resolution Q Fan, Q You, X Han, Y Liu, Y Tao, H Huang, R He, H Yang arXiv preprint arXiv:2403.18361, 2024 | 1 | 2024 |
InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning X Han, Y Jian, X Hu, H Liu, Y Wang, Q Fan, Y Ai, H Huang, R He, Z Yang, ... arXiv preprint arXiv:2409.12568, 2024 | | 2024 |
Semantic Equitable Clustering: A Simple, Fast and Effective Strategy for Vision Transformer Q Fan, H Huang, M Chen, R He arXiv preprint arXiv:2405.13337, 2024 | | 2024 |
Vision Transformer with Sparse Scan Prior Q Fan, H Huang, M Chen, R He arXiv preprint arXiv:2405.13335, 2024 | | 2024 |
Band-Attention Modulated RetNet for Face Forgery Detection Z Zhang, J Cao, W Yang, Q Fan, K Zhou, R He arXiv preprint arXiv:2404.06022, 2024 | | 2024 |
Network Group Partition and Core Placement Optimization for Neuromorphic Multi-Core and Multi-Chip Systems Y Yang, Q Fan, T Yan, J Pei, G Li IEEE Transactions on Emerging Topics in Computational Intelligence, 2024 | | 2024 |