Adaattn: Revisit attention mechanism in arbitrary neural style transfer S Liu, T Lin, D He, F Li, M Wang, X Li, Z Sun, Q Li, E Ding Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 281 | 2021 |
Multi-label classification with label graph superimposing Y Wang, D He, F Li, X Long, Z Zhou, J Ma, S Wen Proceedings of the AAAI Conference on Artificial Intelligence 34 (07), 12265 …, 2020 | 188 | 2020 |
Stnet: Local and global spatial-temporal modeling for action recognition D He, Z Zhou, C Gan, F Li, X Liu, Y Li, L Wang, S Wen Proceedings of the AAAI conference on artificial intelligence 33 (01), 8401-8408, 2019 | 161 | 2019 |
Read, watch, and move: Reinforcement learning for temporally grounding natural language descriptions in videos D He, X Zhao, J Huang, F Li, X Liu, S Wen Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 8393-8400, 2019 | 153 | 2019 |
Multimodal keyless attention fusion for video classification X Long, C Gan, G Melo, X Liu, Y Li, F Li, S Wen Proceedings of the aaai conference on artificial intelligence 32 (1), 2018 | 141 | 2018 |
Dolg: Single-stage image retrieval with deep orthogonal fusion of local and global features M Yang, D He, M Fan, B Shi, X Xue, F Li, E Ding, J Huang Proceedings of the IEEE/CVF International conference on Computer Vision …, 2021 | 119 | 2021 |
Image inpainting by end-to-end cascaded refinement with mask awareness M Zhu, D He, X Li, C Li, F Li, X Liu, E Ding, Z Zhang IEEE Transactions on Image Processing 30, 4855-4866, 2021 | 103 | 2021 |
Drafting and revision: Laplacian pyramid network for fast high-quality artistic style transfer T Lin, Z Ma, F Li, D He, X Li, E Ding, N Wang, J Li, X Gao Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021 | 86 | 2021 |
Paint transformer: Feed forward neural painting with stroke prediction S Liu, T Lin, D He, F Li, R Deng, X Li, E Ding, H Wang Proceedings of the IEEE/CVF international conference on computer vision …, 2021 | 74 | 2021 |
Mvfnet: Multi-view fusion network for efficient video recognition W Wu, D He, T Lin, F Li, C Gan, E Ding Proceedings of the AAAI conference on artificial intelligence 35 (4), 2943-2951, 2021 | 69 | 2021 |
Learning semantic person image generation by region-adaptive normalization Z Lv, X Li, X Li, F Li, T Lin, D He, W Zuo Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 68 | 2021 |
Temporal modeling approaches for large-scale youtube-8m video understanding F Li, C Gan, X Liu, Y Bian, X Long, Y Li, Z Li, J Zhou, S Wen arXiv preprint arXiv:1707.04555, 2017 | 67 | 2017 |
Revisiting the effectiveness of off-the-shelf temporal modeling approaches for large-scale video classification Y Bian, C Gan, X Liu, F Li, X Long, Y Li, H Qi, J Zhou, S Wen, Y Lin arXiv preprint arXiv:1708.03805, 2017 | 57 | 2017 |
GMSS: Graph-based multi-task self-supervised learning for EEG emotion recognition Y Li, J Chen, F Li, B Fu, H Wu, Y Ji, Y Zhou, Y Niu, G Shi, W Zheng IEEE Transactions on Affective Computing 14 (3), 2512-2525, 2022 | 52 | 2022 |
Predict, prevent, and evaluate: Disentangled text-driven image manipulation empowered by pre-trained vision-language model Z Xu, T Lin, H Tang, F Li, D He, N Sebe, R Timofte, L Van Gool, E Ding Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 42 | 2022 |
Videogen: A reference-guided latent diffusion approach for high definition text-to-video generation X Li, W Chu, Y Wu, W Yuan, F Liu, Q Zhang, F Li, H Feng, E Ding, J Wang arXiv preprint arXiv:2309.00398, 2023 | 33 | 2023 |
Deep concept-wise temporal convolutional networks for action localization X Li, T Lin, X Liu, W Zuo, C Li, X Long, D He, F Li, S Wen, C Gan Proceedings of the 28th ACM International Conference on Multimedia, 4004-4012, 2020 | 33 | 2020 |
Aim 2022 challenge on super-resolution of compressed image and video: Dataset, methods and results R Yang, R Timofte, X Li, Q Zhang, L Zhang, F Liu, D He, F Li, H Zheng, ... European Conference on Computer Vision, 174-202, 2022 | 29 | 2022 |
Uatvr: Uncertainty-adaptive text-video retrieval B Fang, W Wu, C Liu, Y Zhou, Y Song, W Wang, X Shu, X Ji, J Wang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 28 | 2023 |
Exploiting spatial-temporal modelling and multi-modal fusion for human action recognition D He, F Li, Q Zhao, X Long, Y Fu, S Wen arXiv preprint arXiv:1806.10319, 2018 | 28 | 2018 |