Planting a seed of vision in large language model Y Ge, Y Ge, Z Zeng, X Wang, Y Shan Technical Report, 2023 | 47 | 2023 |
Contrastive quantization with code memory for unsupervised image retrieval J Wang, Z Zeng, B Chen, T Dai, ST Xia AAAI'22 Oral, Proceedings of the AAAI Conference on Artificial Intelligence …, 2022 | 43 | 2022 |
Making llama see and draw with seed tokenizer Y Ge, S Zhao, Z Zeng, Y Ge, C Li, X Wang, Y Shan ICLR 2024, 2024 | 40 | 2024 |
MISSRec: Pre-training and Transferring Multi-modal Interest-aware Sequence Representation for Recommendation J Wang, Z Zeng, Y Wang, Y Wang, X Lu, T Li, J Yuan, R Zhang, HT Zheng, ... ACM MM'23, 2023 | 21 | 2023 |
SwinFGHash: Fine-grained Image Retrieval via Transformer-based Hashing Network D Lu, J Wang, Z Zeng, B Chen, S Wu, ST Xia BMVC'21, 32nd British Machine Vision Conference, 2021 | 16 | 2021 |
Pyramid hybrid pooling quantization for efficient fine-grained image retrieval Z Zeng, J Wang, B Chen, T Dai, ST Xia, Z Wang Pattern Recognition Letters, 2024 | 14 | 2024 |
Contrastive Masked Autoencoders for Self-Supervised Video Hashing Y Wang, J Wang, B Chen, Z Zeng, ST Xia AAAI'23, Proceedings of the AAAI Conference on Artificial Intelligence 37 (3 …, 2023 | 13 | 2023 |
Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval J Wang, B Chen, D Liao, Z Zeng, G Li, ST Xia, J Xu WWW'22, Proceedings of the ACM Web Conference 2022, 3020-3030, 2022 | 9 | 2022 |
Learning Transferable Spatiotemporal Representations from Natural Script Knowledge Z Zeng, Y Ge, X Liu, B Chen, P Luo, ST Xia, Y Ge CVPR'23, Proceedings of the IEEE/CVF Conference on Computer Vision and …, 2023 | 8 | 2023 |
Hugs Are Better Than Handshakes: Unsupervised Cross-Modal Transformer Hashing with Multi-granularity Alignment J Wang, Z Zeng, B Chen, Y Wang, D Liao, G Li, Y Wang, ST Xia BMVC'22 Oral, 33rd British Machine Vision Conference, 2022 | 7 | 2022 |
TVTSv2: Learning Out-of-the-box Spatiotemporal Visual Representations at Scale Z Zeng, Y Ge, Z Tong, X Liu, ST Xia, Y Shan Technical Report, 2023 | 5 | 2023 |
Motion-Aware Graph Reasoning Hashing for Self-supervised Video Retrieval Z Zeng, J Wang, B Chen, Y Wang, ST Xia BMVC'22, 33rd British Machine Vision Conference, 2022 | 5 | 2022 |
GMMFormer: Gaussian-Mixture-Model based Transformer for Efficient Partially Relevant Video Retrieval Y Wang, J Wang, B Chen, Z Zeng, ST Xia AAAI'24, Proceedings of the AAAI Conference on Artificial Intelligence, 2024 | 1 | 2024 |
Hugs Bring Double Benefits: Unsupervised Cross-Modal Hashing with Multi-granularity Aligned Transformers J Wang, Z Zeng, B Chen, Y Wang, D Liao, G Li, Y Wang, ST Xia International Journal of Computer Vision, 1-33, 2024 | | 2024 |
ConCAP: Contrastive Context-Aware Prompt for Resource-hungry Action Recognition H Zhang, Z Zeng, Q Zhao, Z Zhai ICME'23 Oral, 2023 IEEE International Conference on Multimedia and Expo …, 2023 | | 2023 |