关注
Junyang Wang
Junyang Wang
在 bjtu.edu.cn 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
mPLUG-Owl: Modularization Empowers Large Language Models with Multimodality
Q Ye, H Xu, G Xu, J Ye, M Yan, Y Zhou, J Wang, A Hu, P Shi, Y Shi, C Li, ...
arXiv preprint arXiv:2304.14178, 2023
5432023
Evaluation and Analysis of Hallucination in Large Vision-Language Models
J Wang, Y Zhou, G Xu, P Shi, C Zhao, H Xu, Q Ye, M Yan, J Zhang, J Zhu, ...
arXiv preprint arXiv:2308.15126, 2023
562023
AMBER: An LLM-free Multi-dimensional Benchmark for MLLMs Hallucination Evaluation
J Wang, Y Wang, G Xu, J Zhang, Y Gu, H Jia, J Wang, H Xu, M Yan, ...
arXiv preprint arXiv:2311.07397, 2023
362023
Mobile-Agent: Autonomous Multi-modal Mobile Device Agent with Visual Perception
J Wang, H Xu, J Ye, M Yan, W Shen, J Zhang, F Huang, J Sang
ICLR 2024 Workshop on Large Language Model (LLM) Agents, 2024
222024
FairCLIP: Social Bias Elimination Based on Attribute Prototype Learning and Representation Neutralization
J Wang, Y Zhang, J Sang
arXiv preprint arXiv:2210.14562, 2022
172022
Counterfactually Measuring and Eliminating Social Bias in Vision-Language Pre-training Models
Y Zhang, J Wang, J Sang
Proceedings of the 30th ACM International Conference on Multimedia, 4996-5004, 2022
162022
Zero-shot Image Captioning by Anchor-augmented Vision-language Space Alignment
J Wang, Y Zhang, M Yan, J Zhang, J Sang
arXiv preprint arXiv:2211.07275, 2022
112022
Improved Visual Fine-tuning with Natural Language Supervision
J Wang, Y Xu, J Hu, M Yan, J Sang, Q Qian
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
52023
From Association to Generation: Text-only Captioning by Unsupervised Cross-modal Mapping
J Wang, M Yan, Y Zhang, J Sang
Proceedings of the Thirty-Second International Joint Conference on …, 2023
42023
Benign Shortcut for Debiasing: Fair Visual Recognition via Intervention with Shortcut Features
Y Zhang, J Sang, J Wang, D Jiang, Y Wang
Proceedings of the 31st ACM International Conference on Multimedia, 8860-8868, 2023
32023
Mobile-Agent-v2: Mobile Device Operation Assistant with Effective Navigation via Multi-Agent Collaboration
J Wang, H Xu, H Jia, X Zhang, M Yan, W Shen, J Zhang, F Huang, J Sang
arXiv preprint arXiv:2406.01014, 2024
12024
mPLUG-Octopus: The Versatile Assistant Empowered by A Modularized End-to-End Multimodal LLM
Q Ye, H Xu, M Yan, C Zhao, J Wang, X Yang, J Zhang, F Huang, J Sang, ...
Proceedings of the 31st ACM International Conference on Multimedia, 9365-9367, 2023
12023
Overlap Bias Matching is Necessary for Point Cloud Registration
P Shi, J Zhang, H Cheng, J Wang, Y Zhou, C Zhao, J Zhu
IEEE Robotics and Automation Letters, 2023
12023
Fair Visual Recognition via Intervention with Proxy Features
Y Zhang, J Sang, J Wang
arXiv preprint arXiv:2211.01253, 2022
12022
系统目前无法执行此操作,请稍后再试。
文章 1–14