Banghua Zhu 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	1187	1185
h 指数	16	16
i10 指数	20	20

460

230

115

345

20192020202120222023202422 55 100 198 350 448

开放获取的出版物数量

查看全部

15 篇文章

1 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Jiantao JiaoAssistant Professor of EECS and Statistics, University of California, Berkeley在 berkeley.edu 的电子邮件经过验证
Michael I. JordanProfessor of Electrical Engineering and Computer Sciences and Professor of Statistics, UC Berkeley在 cs.berkeley.edu 的电子邮件经过验证
Cong MaUniversity of Chicago在 uchicago.edu 的电子邮件经过验证
Stuart RussellProfessor of Computer Science, University of California, Berkeley在 cs.berkeley.edu 的电子邮件经过验证
Ying ShengPhD student of Stanford University在 stanford.edu 的电子邮件经过验证
Lianmin ZhengUC Berkeley在 berkeley.edu 的电子邮件经过验证
Ion StoicaProfessor of Computer Science, UC Berkeley在 cs.berkeley.edu 的电子邮件经过验证
Paria RashidinejadPostdoctoral Scholar, University of California, Berkeley在 berkeley.edu 的电子邮件经过验证
Joseph E. GonzalezProfessor of Computer Science, UC Berkeley在 berkeley.edu 的电子邮件经过验证
Jacob SteinhardtStanford University在 cs.stanford.edu 的电子邮件经过验证
Dacheng LiUC Berkeley在 berkeley.edu 的电子邮件经过验证
Hanlin ZhuPh.D. student, University of California, Berkeley在 berkeley.edu 的电子邮件经过验证
Kurt KeutzerProfessor of the Graduate School, EECS, University of California, Berkeley在 berkeley.edu 的电子邮件经过验证
Song JianTsinghua University在 tsinghua.edu.cn 的电子邮件经过验证
Tianhao WuUniversity of California, Berkeley在 berkeley.edu 的电子邮件经过验证
Ikechukwu UchenduHarvard University在 g.harvard.edu 的电子邮件经过验证
Lele WangUniversity of British Columbia在 ece.ubc.ca 的电子邮件经过验证
Nadim GhaddarPostdoctoral Fellow, University of Toronto在 utoronto.ca 的电子邮件经过验证
Shiyi CaoUC Berkeley在 berkeley.edu 的电子邮件经过验证
Ziao WangPhD student in Electrical and Computer Engineering, University of British Columbia在 ece.ubc.ca 的电子邮件经过验证

关注

Banghua Zhu

University of California, Berkeley

在 berkeley.edu 的电子邮件经过验证 - 首页

foundation models human-AI interaction statistics information theory reinforcement learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Bridging offline reinforcement learning and imitation learning: A tale of pessimism P Rashidinejad, B Zhu, C Ma, J Jiao, S Russell Advances in Neural Information Processing Systems 34, 11702-11716, 2021	279	2021
Deconstructing Generative Adversarial Networks B Zhu, J Jiao, D Tse arXiv preprint arXiv:1901.09465, 2019	138*	2019
Principled reinforcement learning with human feedback from pairwise or k-wise comparisons B Zhu, M Jordan, J Jiao International Conference on Machine Learning, 43037-43067, 2023	111	2023
Joint transceiver optimization for wireless communication PHY using neural network B Zhu, J Wang, L He, J Song IEEE Journal on Selected Areas in Communications 37 (6), 1364-1373, 2019	103	2019
Jump-start reinforcement learning I Uchendu, T Xiao, Y Lu, B Zhu, M Yan, J Simon, M Bennice, C Fu, C Ma, ... International Conference on Machine Learning, 34556-34583, 2023	79	2023
Chatbot arena: An open platform for evaluating llms by human preference WL Chiang, L Zheng, Y Sheng, AN Angelopoulos, T Li, D Li, H Zhang, ... arXiv preprint arXiv:2403.04132, 2024	49	2024
Generalized resilience and robust statistics B Zhu, J Jiao, J Steinhardt The Annals of Statistics 50 (4), 2256-2283, 2022	46	2022
Robust estimation via generalized quasi-gradients B Zhu, J Jiao, J Steinhardt Information and Inference: A Journal of the IMA 11 (2), 581-636, 2022	43	2022
Starling-7B: Improving LLM Helpfulness & Harmlessness with RLAIF B Zhu, E Frick, T Wu, H Zhu, J Jiao https://starling.cs.berkeley.edu/, 2023	35	2023
The sample complexity of online contract design B Zhu, S Bates, Z Yang, Y Wang, J Jiao, MI Jordan arXiv preprint arXiv:2211.05732, 2022	35	2022
S-lora: Serving thousands of concurrent lora adapters Y Sheng, S Cao, D Li, C Hooper, N Lee, S Yang, C Chou, B Zhu, L Zheng, ... arXiv preprint arXiv:2311.03285, 2023	29	2023
Byzantine-robust federated learning with optimal statistical rates B Zhu, L Wang, Q Pang, S Wang, J Jiao, D Song, MI Jordan International Conference on Artificial Intelligence and Statistics, 3151-3178, 2023	28*	2023
Sparse tensor decomposition for haplotype assembly of diploids and polyploids A Hashemi, B Zhu, H Vikalo BMC genomics 19, 1-15, 2018	26	2018
Fine-tuning language models with advantage-induced policy alignment B Zhu, H Sharma, FV Frujeri, S Dong, C Zhu, MI Jordan, J Jiao arXiv preprint arXiv:2306.02231, 2023	24	2023
When does the Tukey median work? B Zhu, J Jiao, J Steinhardt 2020 IEEE International Symposium on Information Theory (ISIT), 1201-1206, 2020	18	2020
Pairwise proximal policy optimization: Harnessing relative feedback for llm alignment T Wu, B Zhu, R Zhang, Z Wen, K Ramchandran, J Jiao arXiv preprint arXiv:2310.00212, 2023	17	2023
Online learning in stackelberg games with an omniscient follower G Zhao, B Zhu, J Jiao, M Jordan International Conference on Machine Learning, 42304-42316, 2023	15	2023
Minimax off-policy evaluation for multi-armed bandits C Ma, B Zhu, J Jiao, MJ Wainwright IEEE Transactions on Information Theory 68 (8), 5314-5339, 2022	13	2022
Noisy Sorting Capacity Z Wang, N Ghaddar, B Zhu, L Wang arXiv preprint arXiv:2202.01446, 2023	10	2023
Linear representation meta-reinforcement learning for instant adaptation M Peng, B Zhu, J Jiao arXiv preprint arXiv:2101.04750, 2021	10	2021

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用