Hanlin Zhu 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	237	236
h 指数	6	6
i10 指数	5	5

120

20182019202020212022202320241 1 24 34 22 53 101

开放获取的出版物数量

查看全部

1 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Jiantao JiaoAssistant Professor of EECS and Statistics, University of California, Berkeley在 berkeley.edu 的电子邮件经过验证
Banghua ZhuUniversity of California, Berkeley在 berkeley.edu 的电子邮件经过验证
Stuart RussellProfessor of Computer Science, University of California, Berkeley在 cs.berkeley.edu 的电子邮件经过验证
Paria RashidinejadPostdoctoral Scholar, University of California, Berkeley在 berkeley.edu 的电子邮件经过验证
Ryuichi TakanobumiHoYo在 mihoyo.com 的电子邮件经过验证
Cyrus RashtchianGoogle Research在 eng.ucsd.edu 的电子邮件经过验证
David WoodruffProfessor of Computer Science, Carnegie Mellon University在 cs.cmu.edu 的电子邮件经过验证
Tianhao WuUniversity of California, Berkeley在 berkeley.edu 的电子邮件经过验证
Yuandong TianResearch Scientist, Meta AI (FAIR)在 fb.com 的电子邮件经过验证
Baihe HuangUniversity of California, Berkeley在 berkeley.edu 的电子邮件经过验证
Kunhe YangUniversity of California, Berkeley在 berkeley.edu 的电子邮件经过验证
Danqing WangCarnegie Mellon University在 andrew.cmu.edu 的电子邮件经过验证
Kevin YangUC Berkeley在 berkeley.edu 的电子邮件经过验证
Xiaomeng YangGoogle DeepMind在 google.com 的电子邮件经过验证
Michael I. JordanProfessor of Electrical Engineering and Computer Sciences and Professor of Statistics, UC Berkeley在 cs.berkeley.edu 的电子邮件经过验证
Ruosong WangPhD Student, Carnegie Mellon University在 andrew.cmu.edu 的电子邮件经过验证
Jason D. LeeAssociate Professor of Electrical Engineering and Computer Science, Princeton University在 princeton.edu 的电子邮件经过验证
Amy ZhangAssistant Professor of Electrical and Computer Engineering at University of Texas at Austin在 austin.utexas.edu 的电子邮件经过验证
Benjamin PlautUniversity of California, Berkeley在 berkeley.edu 的电子邮件经过验证
Minlie HuangTsinghua University在 tsinghua.edu.cn 的电子邮件经过验证

关注

Hanlin Zhu

Ph.D. student, University of California, Berkeley

在 berkeley.edu 的电子邮件经过验证 - 首页

machine learning theoretical computer science game theory


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Guided dialog policy learning: Reward estimation for multi-domain task-oriented dialog R Takanobu, H Zhu, M Huang Conference on Empirical Methods in Natural Language Processing, 100-110, 2019	87	2019
Starling-7b: Improving llm helpfulness & harmlessness with rlaif B Zhu, E Frick, T Wu, H Zhu, J Jiao November, 2023	45	2023
Optimal conservative offline rl with general function approximation via augmented lagrangian P Rashidinejad, H Zhu, K Yang, S Russell, J Jiao arXiv preprint arXiv:2211.00716, 2022	32	2022
Vector-matrix-vector queries for solving linear algebra, statistics, and graph problems C Rashtchian, DP Woodruff, H Zhu Approximation, Randomization, and Combinatorial Optimization. Algorithms and …, 2020	31	2020
Importance weighted actor-critic for optimal conservative offline reinforcement learning H Zhu, P Rashidinejad, J Jiao Advances in Neural Information Processing Systems 36, 2024	10	2024
Learning personalized story evaluation D Wang, K Yang, H Zhu, X Yang, A Cohen, L Li, Y Tian arXiv preprint arXiv:2310.03304, 2023	7	2023
Towards optimal statistical watermarking B Huang, B Zhu, H Zhu, JD Lee, J Jiao, MI Jordan arXiv preprint arXiv:2312.07930, 2023	5	2023
Provably efficient reinforcement learning via surprise bound H Zhu, R Wang, J Lee International Conference on Artificial Intelligence and Statistics, 4006-4032, 2023	5	2023
Average-case communication complexity of statistical problems C Rashtchian, D Woodruff, P Ye, H Zhu Conference on Learning Theory, 3859-3886, 2021	5	2021
Provably efficient offline goal-conditioned reinforcement learning with general function approximation and single-policy concentrability H Zhu, A Zhang Advances in Neural Information Processing Systems 36, 2024	4	2024
On Representation Complexity of Model-based and Model-free Reinforcement Learning H Zhu, B Huang, S Russell arXiv preprint arXiv:2310.01706, 2023	3	2023
End-to-end Story Plot Generator H Zhu, A Cohen, D Wang, K Yang, X Yang, J Jiao, Y Tian arXiv preprint arXiv:2310.08796, 2023	2	2023
Efficient Prompt Caching via Embedding Similarity H Zhu, B Zhu, J Jiao arXiv preprint arXiv:2402.01173, 2024	1	2024
Towards a Theoretical Understanding of the'Reversal Curse'via Training Dynamics H Zhu, B Huang, S Zhang, M Jordan, J Jiao, Y Tian, S Russell arXiv preprint arXiv:2405.04669, 2024		2024
Avoiding Catastrophe in Continuous Spaces by Asking for Help B Plaut, H Zhu, S Russell arXiv preprint arXiv:2402.08062, 2024		2024

系统目前无法执行此操作，请稍后再试。

文章 1–15

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用