Ziniu Li 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	233	233
h 指数	7	7
i10 指数	7	7

100

2019202020212022202320241 3 14 40 76 99

开放获取的出版物数量

查看全部

4 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Tian XuNanjing University在 lamda.nju.edu.cn 的电子邮件经过验证
Yang YuProfessor, Nanjing University在 nju.edu.cn 的电子邮件经过验证
Zhi-Quan LuoProfessor, The Chinese University of Hong Kong, Shenzhen, China在 cuhk.edu.cn 的电子邮件经过验证
Yushun ZhangThe Chinese University of Hong Kong, Shenzhen, China在 link.cuhk.edu.cn 的电子邮件经过验证
Yingru LiThe Chinese University of Hong Kong, Shenzhen, China在 link.cuhk.edu.cn 的电子邮件经过验证
Tong ZhangUIUC在 tongzhang-ml.org 的电子邮件经过验证
Zeyu QinHong Kong University of Science and Technology在 connect.ust.hk 的电子邮件经过验证
Jiancong XiaoUniversity of Pennsylvania在 upenn.edu 的电子邮件经过验证
Weijie SuAssociate Professor, University of Pennsylvania在 wharton.upenn.edu 的电子邮件经过验证
Xiong-Hui Chen (陈雄辉)Nanjing University在 lamda.nju.edu.cn 的电子邮件经过验证
Ruoyu SunChinese University of Hong Kong (Shenzhen), Shenzhen Institue of Big Data

关注

Ziniu Li

其他姓名Zi-Niu Li

The Chinese University of Hong Kong, Shenzhen

在 link.cuhk.edu.cn 的电子邮件经过验证 - 首页

Artificial Intelligence Machine Learning Reinforcement Learning Large Language Models


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Error bounds of imitating policies and environments T Xu, Z Li, Y Yu Advances in Neural Information Processing Systems 33, 15737-15749, 2020	97	2020
Error bounds of imitating policies and environments for reinforcement learning T Xu, Z Li, Y Yu IEEE Transactions on Pattern Analysis and Machine Intelligence 44 (10), 6968 …, 2021	33	2021
Self-Guided Evolution Strategies with Historical Estimated Gradients FY Liu, ZN Li, C Qian IJCAI, 1474-1480, 2020	19	2020
HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning Z Li, Y Li, Y Zhang, T Zhang, ZQ Luo International Conference on Learning Representations, 2022	16	2022
ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models Z Li, T Xu, Y Zhang, Z Lin, Y Yu, R Sun, ZQ Luo Forty-first International Conference on Machine Learning, 2023	14*	2023
Rethinking ValueDice - Does It Really Improve Performance? Z Li, T Xu, Y Yu, ZQ Luo ICLR Blog, 2022	13	2022
Understanding adversarial imitation learning in small sample regime: A stage-coupled analysis T Xu, Z Li, Y Yu, ZQ Luo arXiv preprint arXiv:2208.01899, 2022	11*	2022
When is RL better than DPO in RLHF? A Representation and Optimization Perspective Z Li, T Xu, Y Yu ICLR Tiny Paper, 2024	7*	2024
Provably Efficient Adversarial Imitation Learning with Unknown Transitions T Xu, Z Li, Y Yu, ZQ Luo UAI, 2367-2378, 2023	7	2023
Imitation learning from imperfection: Theoretical justifications and algorithms Z Li, T Xu, Z Qin, Y Yu, ZQ Luo Advances in Neural Information Processing Systems 36, 2024	6*	2024
Why transformers need adam: A hessian perspective Y Zhang, C Chen, T Ding, Z Li, R Sun, ZQ Luo arXiv preprint arXiv:2402.16788, 2024	5	2024
On the Algorithmic Bias of Aligning Large Language Models with RLHF: Preference Collapse and Matching Regularization J Xiao, Z Li, X Xie, E Getzen, C Fang, Q Long, WJ Su arXiv preprint arXiv:2405.16455, 2024	3	2024
A Note on Target Q-learning For Solving Finite MDPs with A Generative Oracle Z Li, T Xu, Y Yu arXiv preprint arXiv:2203.11489, 2022	1	2022
Efficient Exploration by Novelty-Pursuit Z Li, XH Chen Distributed Artificial Intelligence: Second International Conference, DAI …, 2020	1	2020
Adam-mini: Use Fewer Learning Rates To Gain More Y Zhang, C Chen, Z Li, T Ding, C Wu, Y Ye, ZQ Luo, R Sun arXiv preprint arXiv:2406.16793, 2024		2024
BWArea Model: Learning World Model, Inverse Dynamics, and Policy for Controllable Language Generation C Jia, P Wang, Z Li, YC Li, Z Zhang, N Tang, Y Yu arXiv preprint arXiv:2405.17039, 2024		2024

系统目前无法执行此操作，请稍后再试。

文章 1–16

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用