Pseudo Numerical Methods for Diffusion Models on Manifolds L Liu, Y Ren, Z Lin, Z Zhao arXiv preprint arXiv:2202.09778, 2022 | 417 | 2022 |
Cross-modal interaction networks for query-based moment retrieval in videos Z Zhang, Z Lin, Z Zhao, Z Xiao Proceedings of the 42nd International ACM SIGIR Conference on Research and …, 2019 | 222 | 2019 |
Weakly-Supervised Video Moment Retrieval via Semantic Completion Network Z Lin, Z Zhao, Z Zhang, Q Wang, H Liu Proceedings of the AAAI Conference on Artificial Intelligence 34, 11539-11546, 2020 | 142 | 2020 |
Counterfactual Contrastive Learning for Weakly-Supervised Vision-Language Grounding Z Zhang, Z Zhao, Z Lin, X He Advances in Neural Information Processing Systems 33, 18123-18134, 2020 | 109 | 2020 |
Cascaded Prediction Network via Segment Tree for Temporal Video Grounding Y Zhao, Z Zhao, Z Zhang, Z Lin Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 74 | 2021 |
BuboGPT: Enabling Visual Grounding in Multi-Modal LLMs Y Zhao, Z Lin, D Zhou, Z Huang, J Feng, B Kang arXiv preprint arXiv:2307.08581, 2023 | 62 | 2023 |
Regularized two-branch proposal networks for weakly-supervised moment retrieval in videos Z Zhang, Z Lin, Z Zhao, J Zhu, X He Proceedings of the 28th ACM International Conference on Multimedia, 4098-4106, 2020 | 62 | 2020 |
Moment Retrieval via Cross-Modal Interaction Networks With Query Reconstruction Z Lin, Z Zhao, Z Zhang, Z Zhang, D Cai IEEE Transactions on Image Processing 29, 3750-3762, 2020 | 57 | 2020 |
Unsupervised representation learning from pre-trained diffusion probabilistic models Z Zhang, Z Zhao, Z Lin Advances in Neural Information Processing Systems 35, 22117-22130, 2022 | 38 | 2022 |
Temporal textual localization in video via adversarial bi-directional interaction networks Z Zhang, Z Zhao, Z Zhang, Z Lin, Q Wang, R Hong IEEE Transactions on Multimedia 23, 3306-3317, 2020 | 26 | 2020 |
Object-Aware Multi-Branch Relation Networks for Spatio-Temporal Video Grounding Z Zhang, Z Zhao, Z Lin, B Huai, NJ Yuan arXiv preprint arXiv:2008.06941, 2020 | 24 | 2020 |
EditAnything: Empowering Unparalleled Flexibility in Image Editing and Generation S Gao, Z Lin, X Xie, P Zhou, MM Cheng, S Yan Proceedings of the 31st ACM International Conference on Multimedia, 9414-9416, 2023 | 18 | 2023 |
PLLaVA: Parameter-free LLaVA Extension from Images to Videos for Video Dense Captioning L Xu, Y Zhao, D Zhou, Z Lin, SK Ng, J Feng arXiv preprint arXiv:2404.16994, 2024 | 17 | 2024 |
SimulLR: Simultaneous Lip Reading Transducer with Attention-Guided Adaptive Memory Z Lin, Z Zhao, H Li, J Liu, M Zhang, X Zeng, X He Proceedings of the 29th ACM International Conference on Multimedia, 1359-1367, 2021 | 16 | 2021 |
Open-Ended Long-Form Video Question Answering via Hierarchical Convolutional Self-Attention Networks Z Zhang, Z Zhao, Z Lin, J Song, X He arXiv preprint arXiv:1906.12158, 2019 | 14 | 2019 |
Localizing Unseen Activities in Video via Image Query Z Zhang, Z Zhao, Z Lin, J Song, D Cai arXiv preprint arXiv:1906.12165, 2019 | 14 | 2019 |
Learning to Rehearse in Long Sequence Memorization Z Zhang, C Zhou, J Ma, Z Lin, J Zhou, H Yang, Z Zhao International Conference on Machine Learning, 12663-12673, 2021 | 11 | 2021 |
Towards Garment Sewing Pattern Reconstruction from a Single Image L Liu, X Xu, Z Lin, J Liang, S Yan ACM Transactions on Graphics (TOG) 42 (6), 1-15, 2023 | 8 | 2023 |
DATE: Domain Adaptive Product Seeker for E-commerce H Li, H Jiang, T Jin, M Li, Y Chen, Z Lin, Y Zhao, Z Zhao Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 5 | 2023 |
MagicVideo-V2: Multi-Stage High-Aesthetic Video Generation W Wang, J Liu, Z Lin, J Yan, S Chen, C Low, T Hoang, J Wu, JH Liew, ... arXiv preprint arXiv:2401.04468, 2024 | 4 | 2024 |