RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models ZM Wang, Z Peng, H Que, J Liu, W Zhou, Y Wu, H Guo, R Gan, Z Ni, ... ACL 2024 (Findings) (posted by Aran Komatsuzaki), 2023 | 60 | 2023 |
Interactive Natural Language Processing Z Wang, G Zhang, K Yang, N Shi, W Zhou, S Hao, G Xiong, Y Li, MY Sim, ... Springer Nature (posted by 机器之心), 2023 | 38 | 2023 |
Chinese open instruction generalist: A preliminary release G Zhang, Y Shi, R Liu, R Yuan, Y Li, S Dong, Y Shu, Z Li, Z Wang, C Lin, ... Instruction Data (posted by AK), 2023 | 20 | 2023 |
Align on the Fly: Adapting Chatbot Behavior to Established Norms C Xu, S Chern, E Chern, G Zhang, Z Wang, R Liu, J Li, J Fu, P Liu arXiv (posted by 机器之心), 2023 | 5 | 2023 |
MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series G Zhang, S Qu, J Liu, C Zhang, C Lin, CL Yu, D Pan, E Cheng, J Liu, ... Foundation Model (posted by AK), 2024 | 3 | 2024 |
COIG-CQIA: Quality is All You Need for Chinese Instruction Fine-tuning Y Bai, X Du, Y Liang, Y Jin, Z Liu, J Zhou, T Zheng, X Zhang, N Ma, ... arXiv (posted by 量子位, 机器之心), 2024 | 2* | 2024 |
CIF-Bench: A Chinese Instruction-Following Benchmark for Evaluating the Generalizability of Large Language Models Y Li, G Zhang, X Qu, J Li, Z Li, Z Wang, H Li, R Yuan, Y Ma, K Zhang, ... ACL 2024 (Findings), 2024 | 1 | 2024 |
LLM Agents for Psychology: A Study on Gamified Assessments Q Yang*, Z Wang*, H Chen, S Wang, Y Pu, X Gao, W Huang, S Song, ... ACL 2024 (Main) (*equal contribution, ordered alphabetically) (posted by 量子位), 2024 | 1 | 2024 |
CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding Benchmark G Zhang, X Du, B Chen, Y Liang, T Luo, T Zheng, K Zhu, Y Cheng, C Xu, ... arXiv (posted by AK), 2024 | 1 | 2024 |
PIN: A Knowledge-Intensive Dataset for Paired and Interleaved Multimodal Documents J Wang, Y Zhang, Y Ji, Y Zhang, C Jiang, Y Wang, K Zhu, Z Wang, ... arXiv preprint arXiv:2406.13923, 2024 | | 2024 |
McEval: Massively Multilingual Code Evaluation L Chai, S Liu, J Yang, Y Yin, K Jin, J Liu, T Sun, G Zhang, C Ren, H Guo, ... arXiv preprint arXiv:2406.07436, 2024 | | 2024 |
II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models Z Liu, F Fang, X Feng, X Du, C Zhang, Z Wang, Y Bai, Q Zhao, L Fan, ... arXiv preprint arXiv:2406.05862, 2024 | | 2024 |