Hallucination augmented contrastive learning for multimodal large language model C Jiang, H Xu, M Dong, J Chen, W Ye, M Yan, Q Ye, J Zhang, F Huang, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 24 | 2024 |
MaVEn: An Effective Multi-granularity Hybrid Visual Encoding Framework for Multimodal Large Language Model C Jiang, J Hongrui, H Xu, W Ye, M Dong, M Yan, J Zhang, F Huang, ... arXiv preprint arXiv:2408.12321, 2024 | | 2024 |