Action recognition using context and appearance distribution features X Wu, D Xu, L Duan, J Luo CVPR 2011, 489-496, 2011 | 285 | 2011 |
Discriminative human action recognition in the learned hierarchical manifold space L Han, X Wu, W Liang, G Hou, Y Jia Image and Vision Computing 28 (5), 836-849, 2010 | 129 | 2010 |
Joint syntax representation learning and visual cue translation for video captioning J Hou, X Wu, W Zhao, J Luo, Y Jia Proceedings of the IEEE/CVF international conference on computer vision …, 2019 | 100 | 2019 |
View-invariant action recognition using latent kernelized structural SVM X Wu, Y Jia Computer Vision–ECCV 2012: 12th European Conference on Computer Vision …, 2012 | 80 | 2012 |
Learning normal patterns via adversarial attention-based autoencoder for abnormal event detection in videos H Song, C Sun, X Wu, M Chen, Y Jia IEEE Transactions on Multimedia 22 (8), 2138-2148, 2019 | 79 | 2019 |
Memcap: Memorizing style knowledge for image captioning W Zhao, X Wu, X Zhang Proceedings of the AAAI Conference on Artificial Intelligence 34 (07), 12984 …, 2020 | 77 | 2020 |
Cross-view action recognition over heterogeneous feature spaces X Wu, H Wang, C Liu, Y Jia Proceedings of the IEEE International Conference on Computer Vision, 609-616, 2013 | 68 | 2013 |
Joint commonsense and relation reasoning for image and video captioning J Hou, X Wu, X Zhang, Y Qi, Y Jia, J Luo Proceedings of the AAAI conference on artificial intelligence 34 (07), 10973 …, 2020 | 67* | 2020 |
Action recognition using multilevel features and latent structural SVM X Wu, D Xu, L Duan, J Luo, Y Jia IEEE transactions on Circuits and Systems for Video Technology 23 (8), 1422-1431, 2013 | 65 | 2013 |
Content-attention representation by factorized action-scene network for action recognition J Hou, X Wu, Y Sun, Y Jia IEEE Transactions on Multimedia 20 (6), 1537-1547, 2017 | 56 | 2017 |
Cross-domain image captioning via cross-modal retrieval and model adaptation W Zhao, X Wu, J Luo IEEE Transactions on Image Processing 30, 1180-1192, 2020 | 47 | 2020 |
Domain adversarial reinforcement learning for partial domain adaptation J Chen, X Wu, L Duan, S Gao IEEE Transactions on Neural Networks and Learning Systems 33 (2), 539-553, 2020 | 46 | 2020 |
Incremental discriminative-analysis of canonical correlations for action recognition X Wu, W Liang, Y Jia 2009 IEEE 12th international conference on computer vision, 2035-2041, 2009 | 39* | 2009 |
Boosting entity-aware image captioning with multi-modal knowledge graph W Zhao, X Wu IEEE Transactions on Multimedia, 2023 | 37 | 2023 |
Temporal action localization in untrimmed videos using action pattern trees H Song, X Wu, B Zhu, Y Wu, M Chen, Y Jia IEEE transactions on multimedia 21 (3), 717-730, 2018 | 31 | 2018 |
Spatial–temporal relation reasoning for action prediction in videos X Wu, R Wang, J Hou, H Lin, J Luo International Journal of Computer Vision 129 (5), 1484-1505, 2021 | 30 | 2021 |
Exploiting images for video recognition: Heterogeneous feature augmentation via symmetric adversarial learning F Yu, X Wu, J Chen, L Duan IEEE Transactions on Image Processing 28 (11), 5308-5321, 2019 | 29 | 2019 |
Multi-modal dependency tree for video captioning W Zhao, X Wu, J Luo Advances in Neural Information Processing Systems 34, 6634-6645, 2021 | 27 | 2021 |
Exploiting informative video segments for temporal action localization C Sun, H Song, X Wu, Y Jia, J Luo IEEE Transactions on Multimedia 24, 274-287, 2021 | 27 | 2021 |
Video annotation via image groups from the web H Wang, X Wu, Y Jia IEEE transactions on multimedia 16 (5), 1282-1291, 2014 | 24 | 2014 |