关注
Maosong Sun
Maosong Sun
Professor of Computer Science and Technology, Tsinghua University
在 tsinghua.edu.cn 的电子邮件经过验证
标题
引用次数
年份
Scaling Large-Language-Model-based Multi-Agent Collaboration
C Qian, Z Xie, Y Wang, W Liu, Y Dang, Z Du, W Chen, C Yang, Z Liu, ...
arXiv preprint arXiv:2406.07155, 2024
2024
Seq1F1B: Efficient Sequence-Level Pipeline Parallelism for Large Language Model Training
S Ao, W Zhao, X Han, C Yang, Z Liu, C Shi, M Sun
arXiv preprint arXiv:2406.03488, 2024
2024
ULTRAFEEDBACK: Boosting Language Models with Scaled AI Feedback
G Cui, L Yuan, N Ding, G Yao, B He, W Zhu, Y Ni, G Xie, R Xie, Y Lin, ...
Forty-first International Conference on Machine Learning, 2024
2024
Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models
B Ping, S Wang, H Wang, X Han, Y Xu, Y Yan, Y Chen, B Chang, Z Liu, ...
arXiv e-prints, arXiv: 2406.08903, 2024
2024
Hyperbolic Pre-Trained Language Model
W Chen, X Han, Y Lin, K He, R Xie, J Zhou, Z Liu, M Sun
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2024
2024
RLAIF-V: Aligning MLLMs through Open-Source AI Feedback for Super GPT-4V Trustworthiness
T Yu, H Zhang, Y Yao, Y Dang, D Chen, X Lu, G Cui, T He, Z Liu, TS Chua, ...
arXiv preprint arXiv:2405.17220, 2024
22024
Personality-affected Emotion Generation in Dialog Systems
Z Wen, J Cao, J Shen, R Yang, S Liu, M Sun
ACM Transactions on Information Systems 42 (5), 1-27, 2024
2024
Iterative Experience Refinement of Software-Developing Agents
C Qian, J Li, Y Dang, W Liu, YF Wang, Z Xie, W Chen, C Yang, Y Zhang, ...
arXiv preprint arXiv:2405.04219, 2024
12024
Fine-Grained Legal Argument-Pair Extraction via Coarse-Grained Pre-training
C Xiao, Y Sun, Y Yao, X Han, W Zhang, Z Liu, M Sun
Proceedings of the 2024 Joint International Conference on Computational …, 2024
2024
LEGENT: Open Platform for Embodied Agents
Z Cheng, Z Wang, J Hu, S Hu, A Liu, Y Tu, P Li, L Shi, Z Liu, M Sun
arXiv preprint arXiv:2404.18243, 2024
2024
Beyond Human Norms: Unveiling Unique Values of Large Language Models through Interdisciplinary Approaches
P Biedma, X Yi, L Huang, M Sun, X Xie
arXiv preprint arXiv:2404.12744, 2024
2024
UltraEval: A Lightweight Platform for Flexible and Comprehensive Evaluation for LLMs
C He, R Luo, S Hu, Y Zhao, J Zhou, H Wu, J Zhang, X Han, Z Liu, M Sun
arXiv preprint arXiv:2404.07584, 2024
12024
Minicpm: Unveiling the potential of small language models with scalable training strategies
S Hu, Y Tu, X Han, C He, G Cui, X Long, Z Zheng, Y Fang, Y Huang, ...
arXiv preprint arXiv:2404.06395, 2024
92024
Advancing llm reasoning generalists with preference trees
L Yuan, G Cui, H Wang, N Ding, X Wang, J Deng, B Shan, H Chen, R Xie, ...
arXiv preprint arXiv:2404.02078, 2024
82024
Robust and scalable model editing for large language models
Y Chen, Z Zhang, X Han, C Xiao, Z Liu, C Chen, K Li, T Yang, M Sun
arXiv preprint arXiv:2403.17431, 2024
12024
Llava-uhd: an lmm perceiving any aspect ratio and high-resolution images
R Xu, Y Yao, Z Guo, J Cui, Z Ni, C Ge, TS Chua, Z Liu, M Sun, G Huang
arXiv preprint arXiv:2403.11703, 2024
82024
BurstAttention: An Efficient Distributed Attention Framework for Extremely Long Sequences
S Ao, W Zhao, X Han, C Yang, Z Liu, C Shi, M Sun, S Wang, T Su
arXiv preprint arXiv:2403.09347, 2024
12024
Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models
N Ding, Y Chen, G Cui, X Lv, R Xie, B Zhou, Z Liu, M Sun
arXiv preprint arXiv:2403.08281, 2024
12024
StableToolBench: Towards Stable Large-Scale Benchmarking on Tool Learning of Large Language Models
Z Guo, S Cheng, H Wang, S Liang, Y Qin, P Li, Z Liu, M Sun, Y Liu
arXiv preprint arXiv:2403.07714, 2024
12024
On the essence and prospect: An investigation of alignment approaches for big models
X Wang, S Duan, X Yi, J Yao, S Zhou, Z Wei, P Zhang, D Xu, M Sun, X Xie
arXiv preprint arXiv:2403.04204, 2024
12024
系统目前无法执行此操作,请稍后再试。
文章 1–20