Dynamicvit: Efficient vision transformers with dynamic token sparsification Y Rao, W Zhao, B Liu, J Lu, J Zhou, CJ Hsieh Advances in neural information processing systems 34, 13937-13949, 2021 | 544 | 2021 |
Denseclip: Language-guided dense prediction with context-aware prompting Y Rao*, W Zhao*, G Chen, Y Tang, Z Zhu, G Huang, J Zhou, J Lu Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 420 | 2022 |
Global filter networks for image classification Y Rao*, W Zhao*, Z Zhu, J Lu, J Zhou Advances in neural information processing systems 34, 980-993, 2021 | 391 | 2021 |
Hornet: Efficient high-order spatial interactions with recursive gated convolutions Y Rao*, W Zhao*, Y Tang, J Zhou, SN Lim, J Lu Advances in Neural Information Processing Systems 35, 10353-10366, 2022 | 233 | 2022 |
Unleashing text-to-image diffusion models for visual perception W Zhao*, Y Rao*, Z Liu*, B Liu, J Zhou, J Lu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 100 | 2023 |
Unipc: A unified predictor-corrector framework for fast sampling of diffusion models W Zhao*, L Bai*, Y Rao, J Zhou, J Lu Advances in Neural Information Processing Systems 36, 2024 | 85 | 2024 |
Group-aware contrastive regression for action quality assessment X Yu, Y Rao, W Zhao, J Lu, J Zhou Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 83 | 2021 |
Difftalk: Crafting diffusion models for generalized audio-driven portraits animation S Shen, W Zhao, Z Meng, W Li, Z Zhu, J Zhou, J Lu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 51 | 2023 |
Towards interpretable deep metric learning with structural matching W Zhao*, Y Rao*, Z Wang, J Lu, J Zhou Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 49 | 2021 |
Difftalk: Crafting diffusion models for generalized talking head synthesis S Shen, W Zhao, Z Meng, W Li, Z Zhu, J Zhou, J Lu arXiv preprint arXiv:2301.03786 2 (4), 5, 2023 | 23 | 2023 |
Dynamic spatial sparsification for efficient vision transformers and convolutional neural networks Y Rao, Z Liu, W Zhao, J Zhou, J Lu IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (9), 10883 …, 2023 | 20 | 2023 |
Diffswap: High-fidelity and controllable face swapping via 3d-aware masked diffusion W Zhao, Y Rao, W Shi, Z Liu, J Zhou, J Lu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 18 | 2023 |
GFNet: Global filter networks for visual recognition Y Rao, W Zhao, Z Zhu, J Zhou, J Lu IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (9), 10960 …, 2023 | 12 | 2023 |
Amixer: Adaptive weight mixing for self-attention free vision transformers Y Rao, W Zhao, J Zhou, J Lu European Conference on Computer Vision, 50-67, 2022 | 4 | 2022 |
Videoabc: A real-world video dataset for abductive visual reasoning W Zhao, Y Rao, Y Tang, J Zhou, J Lu IEEE Transactions on Image Processing 31, 6048-6061, 2022 | 4 | 2022 |
StableSwap: Stable Face Swapping in a Shared and Controllable Latent Space Y Zhu, W Zhao, Y Tang, Y Rao, J Zhou, J Lu IEEE Transactions on Multimedia, 2024 | 1 | 2024 |
FlowIE: Efficient Image Enhancement via Rectified Flow Y Zhu*, W Zhao*, A Li, Y Tang, J Zhou, J Lu arXiv preprint arXiv:2406.00508, 2024 | | 2024 |
DPMesh: Exploiting Diffusion Prior for Occluded Human Mesh Recovery Y Zhu, A Li, Y Tang, W Zhao, J Zhou, J Lu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | | 2024 |
DIML: Deep Interpretable Metric Learning via Structural Matching W Zhao, Y Rao, J Zhou, J Lu IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023 | | 2023 |