Mmmu: A massive multi-discipline multimodal understanding and reasoning benchmark for expert agi X Yue, Y Ni, K Zhang, T Zheng, R Liu, G Zhang, S Stevens, D Jiang, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 134 | 2024 |
Consisti2v: Enhancing visual consistency for image-to-video generation W Ren, H Yang, G Zhang, C Wei, X Du, S Huang, W Chen arXiv preprint arXiv:2402.04324, 2024 | 10 | 2024 |
Anyv2v: A plug-and-play framework for any video-to-video editing tasks M Ku, C Wei, W Ren, H Yang, W Chen arXiv preprint arXiv:2403.14468, 2024 | 7 | 2024 |
Hicu: Leveraging hierarchy for curriculum learning in automated icd coding W Ren, R Zeng, T Wu, T Zhu, RG Krishnan Machine Learning for Healthcare Conference, 198-223, 2022 | 5 | 2022 |
Mmlu-pro: A more robust and challenging multi-task language understanding benchmark Y Wang, X Ma, G Zhang, Y Ni, A Chandra, S Guo, W Ren, A Arulraj, X He, ... arXiv preprint arXiv:2406.01574, 2024 | 4 | 2024 |
Towards transformer-based automated icd coding: Challenges pitfalls and solutions W Ren, T Zhu, R Zeng, T Wu | 2 | 2021 |
StructLM: Towards Building Generalist Models for Structured Knowledge Grounding A Zhuang, G Zhang, T Zheng, X Du, J Wang, W Ren, SW Huang, J Fu, ... arXiv preprint arXiv:2402.16671, 2024 | 1 | 2024 |
Video Diffusion Models: A Survey A Melnik, M Ljubljanac, C Lu, Q Yan, W Ren, H Ritter arXiv preprint arXiv:2405.03150, 2024 | | 2024 |