Deep High-Resolution Representation Learning for Human Pose Estimation K Sun, B Xiao, D Liu, J Wang Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2019 | 4858 | 2019 |
Deep High-Resolution Representation Learning for Visual Recognition J Wang, K Sun, T Cheng, B Jiang, C Deng, Y Zhao, D Liu, Y Mu, M Tan, ... IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020 | 4316* | 2020 |
Simple baselines for human pose estimation and tracking B Xiao, H Wu, Y Wei Proceedings of the European Conference on Computer Vision (ECCV), 466-481, 2018 | 2127 | 2018 |
CvT: Introducing Convolutions to Vision Transformers H Wu, B Xiao, N Codella, M Liu, X Dai, L Yuan, L Zhang ICCV 2021, 2021 | 1889 | 2021 |
Integral human pose regression X Sun, B Xiao, F Wei, S Liang, Y Wei Proceedings of the European Conference on Computer Vision (ECCV), 529-545, 2018 | 929 | 2018 |
HigherHRNet: Scale-Aware Representation Learning for Bottom-Up Human Pose Estimation B Cheng, B Xiao, J Wang, H Shi, TS Huang, L Zhang CVPR, 2020 | 850 | 2020 |
Florence: A New Foundation Model for Computer Vision L Yuan, D Chen, YL Chen, N Codella, X Dai, J Gao, H Hu, X Huang, B Li, ... arXiv preprint arXiv:2111.11432, 2021 | 744 | 2021 |
Focal attention for long-range interactions in vision transformers J Yang, C Li, P Zhang, X Dai, B Xiao, L Yuan, J Gao Advances in Neural Information Processing Systems 34, 30008-30022, 2021 | 534* | 2021 |
Dynamic Head: Unifying Object Detection Heads with Attentions X Dai, Y Chen, B Xiao, D Chen, M Liu, L Yuan, L Zhang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 513 | 2021 |
Interleaved Group Convolutions T Zhang, GJ Qi, B Xiao, J Wang The IEEE International Conference on Computer Vision (ICCV), 4373-4382, 2017 | 395 | 2017 |
Lite-hrnet: A lightweight high-resolution network C Yu, B Xiao, C Gao, L Yuan, L Zhang, N Sang, J Wang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 349 | 2021 |
Multi-Scale Vision Longformer: A New Vision Transformer for High-Resolution Image Encoding P Zhang, X Dai, J Yang, B Xiao, L Yuan, L Zhang, J Gao ICCV 2021, 2021 | 332 | 2021 |
Bottom-Up Human Pose Estimation Via Disentangled Keypoint Regression Z Geng, K Sun, B Xiao, Z Zhang, J Wang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 275 | 2021 |
DaViT: Dual Attention Vision Transformers M Ding, B Xiao, N Codella, P Luo, J Wang, L Yuan ECCV, 2022 | 234 | 2022 |
Efficient Self-supervised Vision Transformers for Representation Learning C Li, J Yang, P Zhang, M Gao, B Xiao, X Dai, L Yuan, J Gao ICLR, 2021 | 210 | 2021 |
Unified contrastive learning in image-text-label space J Yang, C Li, P Zhang, B Xiao, C Liu, L Yuan, J Gao Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 173 | 2022 |
TinyViT: Fast Pretraining Distillation for Small Vision Transformers K Wu, J Zhang, H Peng, M Liu, B Xiao, J Fu, L Yuan ECCV, 2022 | 145 | 2022 |
MiniViT: Compressing Vision Transformers with Weight Multiplexing J Zhang, H Peng, K Wu, M Liu, B Xiao, J Fu, L Yuan Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 102 | 2022 |
Phi-3 technical report: A highly capable language model locally on your phone M Abdin, SA Jacobs, AA Awan, J Aneja, A Awadallah, H Awadalla, ... arXiv preprint arXiv:2404.14219, 2024 | 89 | 2024 |
Mariana: Tencent deep learning platform and its applications Y Zou, X Jin, Y Li, Z Guo, E Wang, B Xiao Proceedings of the VLDB Endowment 7 (13), 1772-1777, 2014 | 66 | 2014 |