Image inpainting via generative multi-column convolutional neural networks Y Wang, X Tao, X Qi, X Shen, J Jia Advances in Neural Information Processing Systems, 331-340, 2018 | 376 | 2018 |
Videochat: Chat-centric video understanding KC Li, Y He, Y Wang, Y Li, W Wang, P Luo, Y Wang, L Wang, Y Qiao arXiv preprint arXiv:2305.06355, 2023 | 324 | 2023 |
Mat: Mask-aware transformer for large hole image inpainting W Li, Z Lin, K Zhou, L Qi, Y Wang, J Jia Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 256 | 2022 |
Videomae v2: Scaling video masked autoencoders with dual masking L Wang, B Huang, Z Zhao, Z Tong, Y He, Y Wang, Y Wang, Y Qiao Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 216 | 2023 |
InternVideo: general video foundation models via generative and discriminative learning Y Wang, K Li, Y Li, Y He, B Huang, Z Zhao, H Zhang, J Xu, Y Liu, Z Wang, ... arXiv preprint arXiv:2212.03191, 2022 | 213 | 2022 |
Wide-context semantic image extrapolation Y Wang, X Tao, X Shen, J Jia Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2019 | 131 | 2019 |
Fast visual object counting via example-based density estimation Y Wang, Y Zou 2016 IEEE international conference on image processing (ICIP), 3653-3657, 2016 | 119 | 2016 |
Internvid: A large-scale video-text dataset for multimodal understanding and generation Y Wang, Y He, Y Li, K Li, J Yu, X Ma, X Li, G Chen, X Chen, Y Wang, C He, ... arXiv preprint arXiv:2307.06942, 2023 | 104 | 2023 |
Lavie: High-quality video generation with cascaded latent diffusion models Y Wang, X Chen, X Ma, S Zhou, Z Huang, Y Wang, C Yang, Y He, J Yu, ... arXiv preprint arXiv:2309.15103, 2023 | 103 | 2023 |
Uniformerv2: Spatiotemporal learning by arming image vits with video uniformer K Li, Y Wang, Y He, Y Li, Y Wang, L Wang, Y Qiao arXiv preprint arXiv:2211.09552, 2022 | 91 | 2022 |
Towards implicit text-guided 3d shape generation Z Liu, Y Wang, X Qi, CW Fu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 89 | 2022 |
VCNet: a robust approach to blind image inpainting Y Wang, YC Chen, X Tao, J Jia European Conference on Computer Vision, 2020 | 83 | 2020 |
Classifying digestive organs in wireless capsule endoscopy images based on deep convolutional neural network Y Zou, L Li, Y Wang, J Yu, Y Li, WJ Deng 2015 IEEE International Conference on Digital Signal Processing (DSP), 1274-1278, 2015 | 83 | 2015 |
Unmasked teacher: Towards training-efficient video foundation models K Li, Y Wang, Y Li, Y Wang, Y He, L Wang, Y Qiao Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 79 | 2023 |
Multi-scale aligned distillation for low-resolution detection L Qi, J Kuen, J Gu, Z Lin, Y Wang, Y Chen, Y Li, J Jia Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 71 | 2021 |
Interngpt: Solving vision-centric tasks by interacting with chatgpt beyond language Z Liu, Y He, W Wang, W Wang, Y Wang, S Chen, Q Zhang, Z Lai, Y Yang, ... arXiv preprint arXiv:2305.05662, 2023 | 66 | 2023 |
Open world entity segmentation L Qi, J Kuen, Y Wang, J Gu, H Zhao, P Torr, Z Lin, J Jia IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (7), 8743-8756, 2022 | 65 | 2022 |
Mvbench: A comprehensive multi-modal video understanding benchmark K Li, Y Wang, Y He, Y Li, Y Wang, Y Liu, Z Wang, J Xu, G Chen, P Luo, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 61 | 2024 |
Image synthesis via semantic composition Y Wang, L Qi, YC Chen, X Zhang, J Jia Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 59 | 2021 |
Learning open-vocabulary semantic segmentation models from natural language supervision J Xu, J Hou, Y Zhang, R Feng, Y Wang, Y Qiao, W Xie Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023 | 57 | 2023 |