Region-object relation-aware dense captioning via transformer Z Shao, J Han, D Marnerides, K Debattista IEEE Transactions on Neural Networks and Learning Systems, 2022 | 130 | 2022 |
Textual context-aware dense captioning with diverse words Z Shao, J Han, K Debattista, Y Pang IEEE Transactions on Multimedia 25, 8753-8766, 2023 | 51 | 2023 |
LSTM-based multi-label video event detection AA Liu, Z Shao, Y Wong, J Li, YT Su, M Kankanhalli Multimedia Tools and Applications 78, 677-695, 2019 | 42 | 2019 |
ESGN: Efficient stereo geometry network for fast 3D object detection A Gao, Y Pang, J Nie, Z Shao, J Cao, Y Guo, X Li IEEE Transactions on Circuits and Systems for Video Technology, 2022 | 22 | 2022 |
DCMSTRD: end-to-end dense captioning via multi-scale transformer decoding Z Shao, J Han, K Debattista, Y Pang IEEE Transactions on Multimedia, 2024 | 16 | 2024 |
Deep intra-image contrastive learning for weakly supervised one-step person search J Wang, Y Pang, J Cao, H Sun, Z Shao, X Li Pattern Recognition 147, 110047, 2024 | 13 | 2024 |
Attentive alignment network for multispectral pedestrian detection N Chen, J Xie, J Nie, J Cao, Z Shao, Y Pang Proceedings of the 31st ACM international conference on multimedia, 3787-3795, 2023 | 12 | 2023 |
View-target relation-guided unsupervised 2D image-based 3D model retrieval via transformer J Chang, L Zhang, Z Shao Multimedia Systems 29 (6), 3891-3901, 2023 | 9 | 2023 |
Illumination-guided transformer-based network for multispectral pedestrian detection F Chu, J Cao, Z Shao, Y Pang CAAI International conference on artificial intelligence, 343-355, 2022 | 8 | 2022 |
Reinforced pedestrian attribute recognition with group optimization reward Z Ji, Z Hu, Y Wang, Z Shao, Y Pang Image and Vision Computing 128, 104585, 2022 | 6 | 2022 |
Toward Generalizable Multispectral Pedestrian Detection F Chu, J Cao, Z Song, Z Shao, Y Pang, X Li IEEE Transactions on Intelligent Transportation Systems, 2023 | 4 | 2023 |
Adaptive semantic transfer network for unsupervised 2D image-based 3D model retrieval D Song, Y Yang, W Li, Z Shao, W Nie, X Li, AA Liu Computer Vision and Image Understanding 238, 103858, 2024 | 3 | 2024 |
Multi-stage reasoning on introspecting and revising bias for visual question answering L An-An, L Zimu, X Ning, L Min, Y Chenggang, Z Bolun, L Bo, D Yulong, ... ACM Transactions on the Web 18 (4), 1-13, 2024 | 2 | 2024 |
Reuse, remanufacturing and recycling in the steel sector C Davis, R Hall, S Hazra, K Debattista, S Zhuang, J Duan, Z Li, J Shenton, ... Philosophical Transactions A 382 (2284), 20230244, 2024 | 1 | 2024 |
CapHDR2IR: Caption-Driven Transfer from Visible Light to Infrared Domain J Peng, T Bashford-Rogers, Z Shao, H Zhao, AR Singh, A Goswami, ... arXiv preprint arXiv:2411.16327, 2024 | | 2024 |
Tutorial: Large Language-Vision Model in Society K Yu, Z Shao, S Qi, D Liu Proceedings of the 32nd ACM International Conference on Multimedia, 11298-11299, 2024 | | 2024 |