Zhuohan Li 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	5517	5508
h 指数	16	16
i10 指数	17	17

3500

1750

875

2625

20192020202120222023202428 125 172 244 1485 3443

开放获取的出版物数量

查看全部

7 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Ion StoicaProfessor of Computer Science, UC Berkeley在 cs.berkeley.edu 的电子邮件经过验证
Hao ZhangUC San Diego在 ucsd.edu 的电子邮件经过验证
Siyuan ZhuangPhD Student, UC Berkeley在 berkeley.edu 的电子邮件经过验证
Joseph E. GonzalezProfessor of Computer Science, UC Berkeley在 berkeley.edu 的电子邮件经过验证
Zi LinUC San Diego在 ucsd.edu 的电子邮件经过验证
Di HePeking University在 pku.edu.cn 的电子邮件经过验证
Danyang ZhuoDuke University在 duke.edu 的电子邮件经过验证
Tao QinSenior Principal Research Manager, Microsoft Research在 microsoft.com 的电子邮件经过验证
Liwei WangProfessor, Peking University在 cis.pku.edu.cn 的电子邮件经过验证
Tie-Yan LiuDistinguished Scientist, Microsoft Research AI4Science | IEEE Fellow | ACM Fellow | AAIA Fellow在 microsoft.com 的电子邮件经过验证
Zhifeng ChenGoogle Inc.在 google.com 的电子邮件经过验证
Zhiqing SunOpenAI在 openai.com 的电子邮件经过验证
Kevin LinUC Berkeley在 berkeley.edu 的电子邮件经过验证
Sheng ShenUC Berkeley在 berkeley.edu 的电子邮件经过验证
Eric WallaceUC Berkeley在 berkeley.edu 的电子邮件经过验证
Kurt KeutzerProfessor of the Graduate School, EECS, University of California, Berkeley在 berkeley.edu 的电子邮件经过验证
Yuanzhong XuGoogle DeepMind在 utexas.edu 的电子邮件经过验证
Linyuan GongUC Berkeley在 berkeley.edu 的电子邮件经过验证
Dawn SongProfessor of Computer Science, UC Berkeley在 cs.berkeley.edu 的电子邮件经过验证
Stephanie WangPhD student, UC Berkeley在 cs.berkeley.edu 的电子邮件经过验证

关注

Zhuohan Li

UC Berkeley

在 berkeley.edu 的电子邮件经过验证 - 首页

Machine Learning Distributed Systems


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Vicuna: An open-source chatbot impressing gpt-4 with 90%* chatgpt quality WL Chiang, Z Li, Z Lin, Y Sheng, Z Wu, H Zhang, L Zheng, S Zhuang, ... See https://vicuna. lmsys. org (accessed 14 April 2023) 2 (3), 6, 2023	1677*	2023
Judging llm-as-a-judge with mt-bench and chatbot arena L Zheng, WL Chiang, Y Sheng, S Zhuang, Z Wu, Y Zhuang, Z Lin, Z Li, ... Advances in Neural Information Processing Systems 36, 2024	1649*	2024
Efficient memory management for large language model serving with pagedattention W Kwon, Z Li, S Zhuang, Y Sheng, L Zheng, CH Yu, J Gonzalez, H Zhang, ... Proceedings of the 29th Symposium on Operating Systems Principles, 611-626, 2023	591	2023
Train big, then compress: Rethinking model size for efficient training and inference of transformers Z Li, E Wallace, S Shen, K Lin, K Keutzer, D Klein, J Gonzalez International Conference on Machine Learning, 5958-5968, 2020	279	2020
Alpa: Automating Inter- and Intra-Operator Parallelism for Distributed Deep Learning L Zheng, Z Li, H Zhang, Y Zhuang, Z Chen, Y Huang, Y Wang, Y Xu, ... arXiv preprint arXiv:2201.12023, 2022	256	2022
Flexgen: High-throughput generative inference of large language models with a single gpu Y Sheng, L Zheng, B Yuan, Z Li, M Ryabinin, B Chen, P Liang, C Ré, ... International Conference on Machine Learning, 31094-31116, 2023	195	2023
Understanding and improving transformer from a multi-particle dynamic system point of view Y Lu, Z Li, D He, Z Sun, B Dong, T Qin, L Wang, TY Liu arXiv preprint arXiv:1906.02762, 2019	185	2019
Efficient training of bert by progressively stacking L Gong, D He, Z Li, T Qin, L Wang, T Liu International conference on machine learning, 2337-2346, 2019	141	2019
Fast structured decoding for sequence models Z Sun, Z Li, H Wang, D He, Z Lin, Z Deng Advances in Neural Information Processing Systems 32, 2019	117	2019
Terapipe: Token-level pipeline parallelism for training large-scale language models Z Li, S Zhuang, S Guo, D Zhuo, H Zhang, D Song, I Stoica International Conference on Machine Learning, 6543-6552, 2021	87	2021
{AlpaServe}: Statistical multiplexing with model parallelism for deep learning serving Z Li, L Zheng, Y Zhong, V Liu, Y Sheng, X Jin, Y Huang, Z Chen, H Zhang, ... 17th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2023	86	2023
Hint-based training for non-autoregressive machine translation Z Li, Z Lin, D He, F Tian, T Qin, L Wang, TY Liu	77	2018
Towards binary-valued gates for robust lstm training Z Li, D He, F Tian, W Chen, T Qin, L Wang, T Liu International Conference on Machine Learning, 2995-3004, 2018	59	2018
Lmsys-chat-1m: A large-scale real-world llm conversation dataset L Zheng, WL Chiang, Y Sheng, T Li, S Zhuang, Z Wu, Y Zhuang, Z Li, ... arXiv preprint arXiv:2309.11998, 2023	54	2023
Hoplite: efficient and fault-tolerant collective communication for task-based distributed systems S Zhuang, Z Li, D Zhuo, S Wang, E Liang, R Nishihara, P Moritz, I Stoica Proceedings of the 2021 ACM SIGCOMM 2021 Conference, 641-656, 2021	26	2021
On optimizing the communication of model parallelism Y Zhuang, L Zheng, Z Li, E Xing, Q Ho, J Gonzalez, I Stoica, H Zhang, ... Proceedings of Machine Learning and Systems 5, 2023	21	2023
Fairness in serving large language models Y Sheng, S Cao, D Li, B Zhu, Z Li, D Zhuo, JE Gonzalez, I Stoica 18th USENIX Symposium on Operating Systems Design and Implementation (OSDI …, 2024	14	2024
Rearchitecting in-memory object stores for low latency D Zhuo, K Zhang, Z Li, S Zhuang, S Wang, A Chen, I Stoica Proceedings of the VLDB Endowment, 555-568, 2021	3	2021
Optimizing Speculative Decoding for Serving Large Language Models Using Goodput X Liu, C Daniel, L Hu, W Kwon, Z Li, X Mo, A Cheung, Z Deng, I Stoica, ... arXiv preprint arXiv:2406.14066, 2024		2024
Simple and Automatic Distributed Machine Learning on Ray H Zhang, Z Li, L Zheng, I Stoica Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data …, 2021		2021

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用