Ocr-oriented master object for text image captioning W Tang, Z Hu, Z Song, R Hong Proceedings of the 2022 International Conference on Multimedia Retrieval, 39-43, 2022 | 9 | 2022 |
Efficient and Self-adaptive Rationale Knowledge Base for Visual Commonsense Reasoning Z Song, Z Hu, R Hong Multimedia Systems 29 (5), 3017–3026, 2023 | 6 | 2023 |
How to Use Language Expert to Assist Inference for Visual Commonsense Reasoning Z Song, W Hu, H Ye, R Hong 2023 IEEE International Conference on Data Mining Workshops (ICDMW), 521-527, 2023 | 1 | 2023 |
Grid Feature Jigsaw for Self-supervised Image Clustering Z Song, Z Hu, R Hong 2023 International Joint Conference on Neural Networks (IJCNN), 1-7, 2023 | 1 | 2023 |
Embedded Heterogeneous Attention Transformer for Cross-lingual Image Captioning Z Song, Z Hu, Y Zhou, Y Zhao, R Hong, M Wang IEEE Transactions on Multimedia, 2024 | | 2024 |
Grid Jigsaw Representation with CLIP: A New Perspective on Image Clustering Z Song, Z Hu, R Hong arXiv preprint arXiv:2310.17869, 2023 | | 2023 |
Dual Video Summarization: From Frames to Captions Z Hu, Z Wang, Z Song, R Hong 2023 International Joint Conference on Artificial Intelligence (IJCAI), 846-854, 2023 | | 2023 |