Tokens-to-token vit: Training vision transformers from scratch on imagenet L Yuan, Y Chen, T Wang, W Yu, Y Shi, ZH Jiang, FEH Tay, J Feng, S Yan Proceedings of the IEEE/CVF International Conference on Computer Vision, 558-567, 2021 | 1964 | 2021 |
Revisiting knowledge distillation via label smoothing regularization L Yuan, FEH Tay, G Li, T Wang, J Feng CVPR Oral 2020, 3903-3911, 2020 | 651* | 2020 |
Distilling object detectors with fine-grained feature imitation T Wang, L Yuan, X Zhang, J Feng Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019 | 401 | 2019 |
Masked Autoencoders for Point Cloud Self-supervised Learning Y Pang, W Wang, FEH Tay, W Liu, Y Tian, L Yuan ECCV, 2022 | 339 | 2022 |
Central similarity quantization for efficient image and video retrieval L Yuan, T Wang, X Zhang, FEH Tay, Z Jie, W Liu, J Feng Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020 | 308 | 2020 |
Volo: Vision outlooker for visual recognition L Yuan, Q Hou, Z Jiang, J Feng, S Yan IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2023 | 276 | 2023 |
All tokens matter: Token labeling for training better vision transformers ZH Jiang, Q Hou, L Yuan, D Zhou, Y Shi, X Jin, A Wang, J Feng Advances in Neural Information Processing Systems 34, 2021 | 232* | 2021 |
Vision permutator: A permutable mlp-like architecture for visual recognition Q Hou, Z Jiang, L Yuan, MM Cheng, S Yan, J Feng IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 2022 | 168 | 2022 |
Few-shot adaptive faster r-cnn T Wang, X Zhang, L Yuan, J Feng Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019 | 163 | 2019 |
ChatLaw: Open-Source Legal Large Language Model with Integrated External Knowledge Bases J Cui, Z Li, Y Yan, B Chen, L Yuan arXiv preprint arXiv:2306.16092, 2023 | 145 | 2023 |
Spikformer: When Spiking Neural Network Meets Transformer Z Zhou, Y Zhu, C He, Y Wang, S Yan, Y Tian, L Yuan ICLR, 2023 | 129 | 2023 |
Cycle-sum: cycle-consistent adversarial lstm networks for unsupervised video summarization L Yuan, FEH Tay, P Li, L Zhou, J Feng Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 9143-9150, 2019 | 127 | 2019 |
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection B Lin, B Zhu, Y Ye, M Ning, P Jin, L Yuan arXiv preprint arXiv:2311.10122, 2023 | 112 | 2023 |
Exploring global diverse attention via pairwise temporal relation for video summarization P Li, Q Ye, L Zhang, L Yuan, X Xu, L Shao Pattern Recognition 111, 107677, 2021 | 92 | 2021 |
PnP-DETR: towards efficient visual analysis with transformers T Wang, L Yuan, Y Chen, J Feng, S Yan Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 74 | 2021 |
Refiner: Refining self-attention for vision transformers D Zhou, Y Shi, B Kang, W Yu, Z Jiang, Y Li, X Jin, Q Hou, J Feng arXiv preprint arXiv:2106.03714, 2021 | 61 | 2021 |
Improving Vision Transformers by Revisiting High-frequency Components J Bai, L Yuan, ST Xia, S Yan, Z Li, W Liu ECCV, 2022 | 59 | 2022 |
Llm lies: Hallucinations are not bugs, but features as adversarial examples JY Yao, KP Ning, ZH Liu, MN Ning, L Yuan arXiv preprint arXiv:2310.01469, 2023 | 55 | 2023 |
Unsupervised video summarization with cycle-consistent adversarial LSTM networks L Yuan, FEH Tay, P Li, J Feng IEEE Transactions on Multimedia 22 (10), 2711-2722, 2019 | 52 | 2019 |
LanguageBind: Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment B Zhu, B Lin, M Ning, Y Yan, J Cui, HF Wang, Y Pang, W Jiang, J Zhang, ... ICLR, 2024 | 44 | 2024 |