Scaling Large-Language-Model-based Multi-Agent Collaboration C Qian, Z Xie, Y Wang, W Liu, Y Dang, Z Du, W Chen, C Yang, Z Liu, ... arXiv preprint arXiv:2406.07155, 2024 | | 2024 |
Seq1F1B: Efficient Sequence-Level Pipeline Parallelism for Large Language Model Training S Ao, W Zhao, X Han, C Yang, Z Liu, C Shi, M Sun arXiv preprint arXiv:2406.03488, 2024 | | 2024 |
ULTRAFEEDBACK: Boosting Language Models with Scaled AI Feedback G Cui, L Yuan, N Ding, G Yao, B He, W Zhu, Y Ni, G Xie, R Xie, Y Lin, ... Forty-first International Conference on Machine Learning, 2024 | | 2024 |
Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models B Ping, S Wang, H Wang, X Han, Y Xu, Y Yan, Y Chen, B Chang, Z Liu, ... arXiv e-prints, arXiv: 2406.08903, 2024 | | 2024 |
Hyperbolic Pre-Trained Language Model W Chen, X Han, Y Lin, K He, R Xie, J Zhou, Z Liu, M Sun IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024 | | 2024 |
RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness T Yu, H Zhang, Y Yao, Y Dang, D Chen, X Lu, G Cui, T He, Z Liu, TS Chua, ... arXiv preprint arXiv:2405.17220, 2024 | 2 | 2024 |
Personality-affected Emotion Generation in Dialog Systems Z Wen, J Cao, J Shen, R Yang, S Liu, M Sun ACM Transactions on Information Systems 42 (5), 1-27, 2024 | | 2024 |
Iterative Experience Refinement of Software-Developing Agents C Qian, J Li, Y Dang, W Liu, YF Wang, Z Xie, W Chen, C Yang, Y Zhang, ... arXiv preprint arXiv:2405.04219, 2024 | 1 | 2024 |
Fine-Grained Legal Argument-Pair Extraction via Coarse-Grained Pre-training C Xiao, Y Sun, Y Yao, X Han, W Zhang, Z Liu, M Sun Proceedings of the 2024 Joint International Conference on Computational …, 2024 | | 2024 |
LEGENT: Open Platform for Embodied Agents Z Cheng, Z Wang, J Hu, S Hu, A Liu, Y Tu, P Li, L Shi, Z Liu, M Sun arXiv preprint arXiv:2404.18243, 2024 | | 2024 |
Beyond Human Norms: Unveiling Unique Values of Large Language Models through Interdisciplinary Approaches P Biedma, X Yi, L Huang, M Sun, X Xie arXiv preprint arXiv:2404.12744, 2024 | | 2024 |
UltraEval: A Lightweight Platform for Flexible and Comprehensive Evaluation for LLMs C He, R Luo, S Hu, Y Zhao, J Zhou, H Wu, J Zhang, X Han, Z Liu, M Sun arXiv preprint arXiv:2404.07584, 2024 | 1 | 2024 |
Minicpm: Unveiling the potential of small language models with scalable training strategies S Hu, Y Tu, X Han, C He, G Cui, X Long, Z Zheng, Y Fang, Y Huang, ... arXiv preprint arXiv:2404.06395, 2024 | 9 | 2024 |
Advancing llm reasoning generalists with preference trees L Yuan, G Cui, H Wang, N Ding, X Wang, J Deng, B Shan, H Chen, R Xie, ... arXiv preprint arXiv:2404.02078, 2024 | 8 | 2024 |
Robust and scalable model editing for large language models Y Chen, Z Zhang, X Han, C Xiao, Z Liu, C Chen, K Li, T Yang, M Sun arXiv preprint arXiv:2403.17431, 2024 | 1 | 2024 |
Llava-uhd: an lmm perceiving any aspect ratio and high-resolution images R Xu, Y Yao, Z Guo, J Cui, Z Ni, C Ge, TS Chua, Z Liu, M Sun, G Huang arXiv preprint arXiv:2403.11703, 2024 | 8 | 2024 |
BurstAttention: An Efficient Distributed Attention Framework for Extremely Long Sequences S Ao, W Zhao, X Han, C Yang, Z Liu, C Shi, M Sun, S Wang, T Su arXiv preprint arXiv:2403.09347, 2024 | 1 | 2024 |
Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models N Ding, Y Chen, G Cui, X Lv, R Xie, B Zhou, Z Liu, M Sun arXiv preprint arXiv:2403.08281, 2024 | 1 | 2024 |
StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models Z Guo, S Cheng, H Wang, S Liang, Y Qin, P Li, Z Liu, M Sun, Y Liu arXiv preprint arXiv:2403.07714, 2024 | 1 | 2024 |
On the essence and prospect: An investigation of alignment approaches for big models X Wang, S Duan, X Yi, J Yao, S Zhou, Z Wei, P Zhang, D Xu, M Sun, X Xie arXiv preprint arXiv:2403.04204, 2024 | 1 | 2024 |