SegFormer: Simple and efficient design for semantic segmentation with transformers E Xie, W Wang, Z Yu, A Anandkumar, JM Alvarez, P Luo Advances in Neural Information Processing Systems (NeurIPS) 34, 12077-12090, 2021 | 3580 | 2021 |
Pyramid vision transformer: A versatile backbone for dense prediction without convolutions W Wang, E Xie, X Li, DP Fan, K Song, D Liang, T Lu, P Luo, L Shao IEEE/CVF International Conference on Computer Vision (ICCV), 568-578, 2021 | 3517 | 2021 |
Selective Kernel Networks X Li, W Wang, X Hu, J Yang IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019 | 2487 | 2019 |
PVT v2: Improved Baselines with Pyramid Vision Transformer W Wang, E Xie, X Li, DP Fan, K Song, D Liang, T Lu, P Luo, L Shao Computational Visual Media Journal (CVMJ), 2022 | 1114 | 2022 |
Generalized Focal Loss: Learning Qualified and Distributed Bounding Boxes for Dense Object Detection X Li, W Wang, L Wu, S Chen, X Hu, J Li, J Tang, J Yang Advances in Neural Information Processing Systems (NeurIPS), 2020 | 886 | 2020 |
BEVFormer: Learning Bird's-Eye-View Representation from Multi-Camera Images via Spatiotemporal Transformers Z Li, W Wang, H Li, E Xie, C Sima, T Lu, Q Yu, J Dai European Conference on Computer Vision (ECCV), 2022 | 778 | 2022 |
Shape Robust Text Detection with Progressive Scale Expansion Network W Wang, E Xie, X Li, W Hou, T Lu, G Yu, S Shao IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019 | 741 | 2019 |
Polarmask: Single shot instance segmentation with polar representation E Xie, P Sun, X Song, W Wang, X Liu, D Liang, C Shen, P Luo IEEE/CVF conference on computer vision and pattern recognition (CVPR), 12193 …, 2020 | 648 | 2020 |
Efficient and Accurate Arbitrary-Shaped Text Detection with Pixel Aggregation Network W Wang, E Xie, X Song, Y Zang, W Wang, T Lu, G Yu, C Shen IEEE/CVF International Conference on Computer Vision (ICCV), 2019 | 524 | 2019 |
Internimage: Exploring large-scale vision foundation models with deformable convolutions W Wang, J Dai, Z Chen, Z Huang, Z Li, X Zhu, X Hu, T Lu, L Lu, H Li, ... IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023 | 432 | 2023 |
Vision transformer adapter for dense predictions Z Chen, Y Duan, W Wang, J He, T Lu, J Dai, Y Qiao International Conference on Learning Representation (ICLR), 2023 | 408 | 2023 |
Detco: Unsupervised Contrastive Learning for Object Detection E Xie, J Ding, W Wang, X Zhan, H Xu, Z Li, P Luo IEEE/CVF International Conference on Computer Vision (ICCV), 2021 | 343 | 2021 |
Goal-oriented Autonomous Driving Y Hu, J Yang, L Chen, K Li, C Sima, X Zhu, S Chai, S Du, T Lin, W Wang, ... IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023 | 285* | 2023 |
Polyp-PVT: Polyp Segmentation with Pyramid Vision Transformers B Dong, W Wang, DP Fan, J Li, H Fu, L Shao CAAI Artificial Intelligence Research (CAAI AIR), 2023 | 275 | 2023 |
Videochat: Chat-centric video understanding KC Li, Y He, Y Wang, Y Li, W Wang, P Luo, Y Wang, L Wang, Y Qiao arXiv preprint arXiv:2305.06355, 2023 | 265 | 2023 |
Generalized Focal Loss V2: Learning Reliable Localization Quality Estimation for Dense Object Detection X Li, W Wang, X Hu, J Li, J Tang, J Yang IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021 | 251 | 2021 |
Visionllm: Large language model is also an open-ended decoder for vision-centric tasks W Wang, Z Chen, X Chen, J Wu, X Zhu, G Zeng, P Luo, T Lu, J Zhou, ... Advances in Neural Information Processing Systems (NeurIPS), 2023 | 222 | 2023 |
Scene Text Image Super-Resolution in the Wild W Wang, E Xie, X Liu, W Wang, D Liang, C Shen, X Bai European Conference on Computer Vision (ECCV), 650-666, 2020 | 162* | 2020 |
Panoptic SegFormer: Delving Deeper into Panoptic Segmentation with Transformers Z Li, W Wang, E Xie, Z Yu, A Anandkumar, JM Alvarez, T Lu, P Luo IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022 | 142* | 2022 |
Differentiable Hierarchical Graph Grouping for Multi-Person Pose Estimation S Jin, W Liu, E Xie, W Wang, C Qian, W Ouyang, P Luo European Conference on Computer Vision (ECCV), 2020 | 138 | 2020 |