Difnet: Boosting visual information flow for image captioning M Wu, X Zhang, X Sun, Y Zhou, C Chen, J Gu, X Sun, R Ji Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022 | 52 | 2022 |
End-to-end zero-shot hoi detection via vision and language knowledge distillation M Wu, J Gu, Y Shen, M Lin, C Chen, X Sun Proceedings of the AAAI Conference on Artificial Intelligence 37 (3), 2839-2846, 2023 | 37 | 2023 |
Evaluating and Analyzing Relationship Hallucinations in Large Vision-Language Models M Wu, J Ji, O Huang, J Li, Y Wu, X Sun, R Ji Proceedings of the 41st International Conference on Machine Learning, 53553 …, 2024 | 13* | 2024 |
Controlmllm: Training-free visual prompt learning for multimodal large language models M Wu, X Cai, J Ji, J Li, O Huang, G Luo, H Fei, X Sun, R Ji NeurIPS2024, 2024 | 8 | 2024 |
Tradiffusion: Trajectory-based training-free image generation M Wu, O Huang, J Ji, J Li, X Cai, H Kuang, J Liu, X Sun, R Ji arXiv preprint arXiv:2408.09739, 2024 | 2 | 2024 |
Toward Open-Set Human Object Interaction Detection M Wu, Y Liu, J Ji, X Sun, R Ji Proceedings of the AAAI Conference on Artificial Intelligence 38 (6), 6066-6073, 2024 | 2 | 2024 |
Tradiffusion++: Hierarchical Guidance for Fine-Grained Trajectory-Based Image Generation O Huang, M Wu, X Sun, J Ji, J Li, R Dong, R Ji | | |