Beyond frame-level CNN: saliency-aware 3-D CNN with LSTM for video action recognition X Wang, L Gao, J Song, H Shen IEEE signal processing letters 24 (4), 510-514, 2016 | 304 | 2016 |
Two-stream 3-d convnet fusion for action recognition in videos with arbitrary size and length X Wang, L Gao, P Wang, X Sun, X Liu IEEE Transactions on Multimedia 20 (3), 634-644, 2017 | 271 | 2017 |
From general to specific: Informative scene graph generation via balance adjustment Y Guo, L Gao, X Wang, Y Hu, X Xu, X Lu, HT Shen, J Song Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 93 | 2021 |
Learnable aggregating net with diversity learning for video question answering X Li, L Gao, X Wang, W Liu, X Xu, HT Shen, J Song Proceedings of the 27th ACM international conference on multimedia, 1166-1174, 2019 | 72 | 2019 |
Deep appearance and motion learning for egocentric activity recognition X Wang, L Gao, J Song, X Zhen, N Sebe, HT Shen Neurocomputing 275, 438-447, 2018 | 52 | 2018 |
Fused GRU with semantic-temporal attention for video captioning L Gao, X Wang, J Song, Y Liu Neurocomputing 395, 222-228, 2020 | 48 | 2020 |
Skeleton-based action recognition via adaptive cross-form learning X Wang, Y Dai, L Gao, J Song Proceedings of the 30th ACM International Conference on Multimedia, 1670-1678, 2022 | 20 | 2022 |
Mke-gcn: Multi-modal knowledge embedded graph convolutional network for skeleton-based action recognition in the wild S Yang, X Wang, L Gao, J Song 2022 IEEE International Conference on Multimedia and Expo (ICME), 01-06, 2022 | 19 | 2022 |
KTN: Knowledge transfer network for learning multiperson 2D-3D correspondences X Wang, L Gao, Y Zhou, J Song, M Wang IEEE Transactions on Circuits and Systems for Video Technology 32 (11), 7732 …, 2022 | 17 | 2022 |
Ktn: Knowledge transfer network for multi-person densepose estimation X Wang, L Gao, J Song, HT Shen Proceedings of the 28th ACM International conference on multimedia, 3780-3788, 2020 | 13 | 2020 |
RSGNet: Relation based skeleton graph network for crowded scenes pose estimation Y Dai, X Wang, L Gao, J Song, HT Shen Proceedings of the AAAI Conference on Artificial Intelligence 35 (2), 1193-1200, 2021 | 12 | 2021 |
Semantic-aware transfer with instance-adaptive parsing for crowded scenes pose estimation X Wang, L Gao, Y Dai, Y Zhou, J Song Proceedings of the 29th ACM International Conference on Multimedia, 686-694, 2021 | 10 | 2021 |
Amanet: Adaptive multi-path aggregation for learning human 2d-3d correspondences X Wang, Y Guo, J Song, L Gao, HT Shen IEEE Transactions on Multimedia 25, 979-992, 2021 | 9 | 2021 |
KE-RCNN: Unifying knowledge-based reasoning into part-level attribute parsing X Wang, J Song, X Chen, L Cheng, L Gao, HT Shen IEEE Transactions on Cybernetics 53 (11), 7263-7274, 2022 | 8 | 2022 |
X-hrnet: Towards lightweight human pose estimation with spatially unidimensional self-attention Y Zhou, X Wang, X Xu, L Zhao, J Song 2022 IEEE international conference on multimedia and expo (ICME), 01-06, 2022 | 7 | 2022 |
Resparser: Fully convolutional multiple human parsing with representative sets Y Dai, X Chen, X Wang, M Pang, L Gao, HT Shen IEEE Transactions on Multimedia 26, 1384-1394, 2023 | 5 | 2023 |
Repparser: end-to-end multiple human parsing with representative parts X Chen, X Wang, L Gao, J Song arXiv preprint arXiv:2208.12908, 2022 | 4 | 2022 |
Overcoming Data Deficiency for Multi-Person Pose Estimation Y Dai, X Wang, L Gao, J Song, F Zheng, HT Shen IEEE Transactions on Neural Networks and Learning Systems, 2023 | 3 | 2023 |
Technical report: Disentangled action parsing networks for accurate part-level action parsing X Wang, X Chen, L Gao, L Chen, J Song arXiv preprint arXiv:2111.03225, 2021 | 3 | 2021 |
EANet: Towards Lightweight Human Pose Estimation With Effective Aggregation Network B Chen, X Wang, X Chen, Y He, J Song 2023 IEEE International Conference on Multimedia and Expo (ICME), 2639-2644, 2023 | 2 | 2023 |