Beyond rnns: Positional self-attention with co-attention for video question answering X Li, J Song, L Gao, X Liu, W Huang, X He, C Gan Proceedings of the AAAI conference on artificial intelligence 33 (01), 8658-8665, 2019 | 300 | 2019 |
Hierarchical LSTMs with adaptive attention for visual captioning L Gao, X Li, J Song, HT Shen IEEE transactions on pattern analysis and machine intelligence 42 (5), 1112-1131, 2019 | 279 | 2019 |
Self-supervised video hashing with hierarchical binary auto-encoder J Song, H Zhang, X Li, L Gao, M Wang, R Hong IEEE Transactions on Image Processing 27 (7), 3210-3221, 2018 | 270 | 2018 |
Learnable aggregating net with diversity learning for video question answering X Li, L Gao, X Wang, W Liu, X Xu, HT Shen, J Song Proceedings of the 27th ACM international conference on multimedia, 1166-1174, 2019 | 72 | 2019 |
Rich visual knowledge-based augmentation network for visual question answering L Zhang, S Liu, D Liu, P Zeng, X Li, J Song, L Gao IEEE Transactions on Neural Networks and Learning Systems 32 (10), 4362-4373, 2020 | 59 | 2020 |
Residual attention-based LSTM for video captioning X Li, Z Zhou, L Chen, L Gao World Wide Web 22, 621-636, 2019 | 44 | 2019 |
Text-instance graph: Exploring the relational semantics for text-based visual question answering X Li, B Wu, J Song, L Gao, P Zeng, C Gan Pattern Recognition 124, 108455, 2022 | 30 | 2022 |
Scenario-aware recurrent transformer for goal-directed video captioning X Man, D Ouyang, X Li, J Song, J Shao ACM Transactions on Multimedia Computing, Communications, and Applications …, 2022 | 30 | 2022 |
Visual commonsense-aware representation network for video captioning P Zeng, H Zhang, L Gao, X Li, J Qian, HT Shen IEEE Transactions on Neural Networks and Learning Systems, 2023 | 14 | 2023 |
Kernel based latent semantic sparse hashing for large-scale retrieval from heterogeneous data sources X Li, L Gao, X Xu, J Shao, F Shen, J Song Neurocomputing 253, 89-96, 2017 | 14 | 2017 |
Generalized pyramid co-attention with learnable aggregation net for video question answering L Gao, T Chen, X Li, P Zeng, L Zhao, YF Li Pattern Recognition 120, 108145, 2021 | 9 | 2021 |
Exploring contextual-aware representation and linguistic-diverse expression for visual dialog X Li, L Gao, L Zhao, J Song Proceedings of the 29th ACM International Conference on Multimedia, 4911-4919, 2021 | 4 | 2021 |
Relation-aware aggregation network with auxiliary guidance for text-based person search P Zeng, S Jing, J Song, K Fan, X Li, L We, Y Guo World Wide Web, 1-18, 2022 | 3 | 2022 |
You should know more: Learning external knowledge for visual dialog L Zhao, H Zhang, X Li, S Yang, Y Song Neurocomputing 488, 54-65, 2022 | 3 | 2022 |