Rlhf-v: Towards trustworthy mllms via behavior alignment from fine-grained correctional human feedback T Yu, Y Yao, H Zhang, T He, Y Han, G Cui, J Hu, Z Liu, HT Zheng, M Sun, ... CVPR 2024, 2023 | 39 | 2023 |
Towards interpretable natural language understanding with explanations as latent variables W Zhou*, J Hu*, H Zhang*, X Liang, M Sun, C Xiong, J Tang Advances in Neural Information Processing Systems 33, 6803-6814, 2020 | 32 | 2020 |
Fuse It More Deeply! A Variational Transformer with Layer-Wise Latent Variable Inference for Text Generation J Hu, X Yi, W Li, M Sun, X Xie NAACL 2022, 2022 | 20 | 2022 |
Large multilingual models pivot zero-shot multimodal learning across languages J Hu, Y Yao, C Wang, S Wang, Y Pan, Q Chen, T Yu, H Wu, Y Zhao, ... ICLR 2024 (spotlight), 2023 | 16 | 2023 |
Generating major types of chinese classical poetry in a uniformed framework J Hu, M Sun arXiv preprint arXiv:2003.11528, 2020 | 15 | 2020 |
Reformulating Vision-Language Foundation Models and Datasets Towards Universal Multimodal Assistants T Yu*, J Hu*, Y Yao, H Zhang, Y Zhao, C Wang, S Wang, Y Pan, J Xue, ... arXiv preprint arXiv:2310.00653, 2023 | 9 | 2023 |
Aspect-level sentiment-controllable review generation with mutual learning framework H Chen, Y Lin, F Qi, J Hu, P Li, J Zhou, M Sun Proceedings of the AAAI conference on artificial intelligence 35 (14), 12639 …, 2021 | 8 | 2021 |
OlympiadBench: A Challenging Benchmark for Promoting AGI with Olympiad-Level Bilingual Multimodal Scientific Problems C He, R Luo, Y Bai, S Hu, ZL Thai, J Shen, J Hu, X Han, Y Huang, ... ACL 2024, 2024 | 6 | 2024 |
Efficient cross-lingual transfer for chinese stable diffusion with images as pivots J Hu, X Han, X Yi, Y Chen, W Li, Z Liu, M Sun arXiv preprint arXiv:2305.11540, 2023 | 4 | 2023 |
Revisiting non-autoregressive transformers for efficient image synthesis Z Ni, Y Wang, R Zhou, J Guo, J Hu, Z Liu, S Song, Y Yao, G Huang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 3 | 2024 |
Exploring Perceptual Limitation of Multimodal Large Language Models J Zhang, J Hu, M Khayatkhoei, F Ilievski, M Sun arXiv preprint arXiv:2402.07384, 2024 | 2 | 2024 |
Recurrence Boosts Diversity! Revisiting Recurrent Latent Variable in Transformer-Based Variational AutoEncoder for Diverse Text Generation J Hu, X Yi, W Li, M Sun, X Xie Findings of EMNLP 2022, 2022 | 1 | 2022 |
GUICourse: From General Vision Language Models to Versatile GUI Agents W Chen, J Cui, J Hu, Y Qin, J Fang, Y Zhao, C Wang, J Liu, G Chen, ... arXiv preprint arXiv:2406.11317, 2024 | | 2024 |
scMulan: a multitask generative pre-trained language model for single-cell analysis H Bian, Y Chen, X Dong, C Li, M Hao, S Chen, J Hu, M Sun, L Wei, ... International Conference on Research in Computational Molecular Biology, 479-482, 2024 | | 2024 |
LEGENT: Open Platform for Embodied Agents Z Cheng, Z Wang, J Hu, S Hu, A Liu, Y Tu, P Li, L Shi, Z Liu, M Sun arXiv preprint arXiv:2404.18243, 2024 | | 2024 |
Evade the Trap of Mediocrity: Promoting Diversity and Novelty in Text Generation via Concentrating Attention W Li, X Yi, J Hu, M Sun, X Xie arXiv preprint arXiv:2211.07164, 2022 | | 2022 |