Prompt-aligned Gradient for Prompt Tuning B Zhu, Y Niu, Y Han, Y Wu, H Zhang arXiv preprint arXiv:2205.14865, 2022 | 155 | 2022 |
AppAgent: Multimodal Agents as Smartphone Users Z Yang, J Liu, Y Han, X Chen, Z Huang, B Fu, G Yu arXiv preprint arXiv:2312.13771, 2023 | 44 | 2023 |
Learning multiscale hierarchical attention for video summarization W Zhu, J Lu, Y Han, J Zhou Pattern Recognition 122, 108312, 2022 | 42 | 2022 |
Relational Reasoning Over Spatial-Temporal Graphs for Video Summarization W Zhu, Y Han, J Lu, J Zhou IEEE Transactions on Image Processing 31, 3017-3031, 2022 | 33 | 2022 |
ChartLlama: A Multimodal LLM for Chart Understanding and Generation Y Han, C Zhang, X Chen, X Yang, Z Wang, G Yu, B Fu, H Zhang arXiv preprint arXiv:2311.16483, 2023 | 20 | 2023 |
Fast AdvProp J Mei, Y Han, Y Bai, Y Zhang, Y Li, X Li, A Yuille, C Xie International Conference on Learning Representations, 2021 | 10 | 2021 |
Robust Process Identification from Step Response Data and Parallel Implementation Y Han, Q Liu, C Shang, D Huang 2021 3rd International Conference on Industrial Artificial Intelligence (IAI …, 2021 | 1 | 2021 |
EMMA: Your Text-to-Image Diffusion Model Can Secretly Accept Multi-Modal Prompts Y Han, R Wang, C Zhang, J Hu, P Cheng, B Fu, H Zhang arXiv preprint arXiv:2406.09162, 2024 | | 2024 |
Dual-Perspective Knowledge Enrichment for Semi-Supervised 3D Object Detection Y Han, N Zhao, W Chen, KT Ma, H Zhang arXiv preprint arXiv:2401.05011, 2024 | | 2024 |