Huang Jiawei 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	485	485
h 指数	7	7
i10 指数	7	7

160

120

2019202020212022202320247 30 76 117 148 106

开放获取的出版物数量

查看全部

5 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Nan JiangAssistant Professor of Computer Science, UIUC在 illinois.edu 的电子邮件经过验证
Masatoshi UeharaGenentech在 gene.com 的电子邮件经过验证
Ningning MaNIO在 ust.hk 的电子邮件经过验证
Xiangyu ZhangPrincipal Researcher, MEGVII Technology在 megvii.com 的电子邮件经过验证
Jian SunChief Scientist of Megvii, Managing Director of Megvii Research在 megvii.com 的电子邮件经过验证
Li ZhaoResearcher在 microsoft.com 的电子邮件经过验证
Tao QinSenior Principal Research Manager, Microsoft Research在 microsoft.com 的电子邮件经过验证
Tie-Yan LiuDistinguished Scientist, Microsoft Research AI4Science | IEEE Fellow | ACM Fellow | AAIA Fellow在 microsoft.com 的电子邮件经过验证
Niao HeETH Zürich在 inf.ethz.ch 的电子邮件经过验证
Chengchun ShiLondon School of Economics and Political Science在 lse.ac.uk 的电子邮件经过验证
Jinglin ChenUniversity of Illinois Urbana-Champaign在 illinois.edu 的电子邮件经过验证
Batuhan YardimETH Zurich在 ethz.ch 的电子邮件经过验证
Wei Chen （陈卫）Microsoft Research在 microsoft.com 的电子邮件经过验证
Andreas KrauseProfessor of Computer Science, ETH Zurich在 inf.ethz.ch 的电子邮件经过验证
Vinzenz ThomaDoctoral Fellow, ETH Zurich在 ethz.ch 的电子邮件经过验证
Zebang Shen 沈泽邦ETH Zürich在 inf.ethz.ch 的电子邮件经过验证
Heinrich H. NaxSNSF Assistant Professor在 ethz.ch 的电子邮件经过验证

关注

Huang Jiawei

ETH Zurich

在 inf.ethz.ch 的电子邮件经过验证 - 首页

Machine Learning Reinforcement Learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Minimax weight and q-function learning for off-policy evaluation M Uehara, J Huang, N Jiang International Conference on Machine Learning, 9659-9668, 2019	183	2019
Weightnet: Revisiting the design space of weight networks N Ma, X Zhang, J Huang, J Sun European Conference on Computer Vision, 776-792, 2020	109	2020
Minimax value interval for off-policy evaluation and policy optimization N Jiang, J Huang Advances in Neural Information Processing Systems 33, 2747-2758, 2020	80	2020
A minimax learning approach to off-policy evaluation in confounded Partially Observable Markov Decision Processes C Shi, M Uehara, J Huang, N Jiang International Conference on Machine Learning, 2022	37	2022
From Importance Sampling to Doubly Robust Policy Gradient J Huang, N Jiang International Conference on Machine Learning, 4434-4443, 2019	30	2019
Towards Deployment-Efficient Reinforcement Learning: Lower Bound and Optimality J Huang, J Chen, L Zhao, T Qin, N Jiang, TY Liu International Conference on Learning Representations 2022, 2022	25	2022
On the convergence rate of off-policy policy optimization methods with density-ratio correction J Huang, N Jiang International Conference on Artificial Intelligence and Statistics, 2658-2705, 2022	10*	2022
On the Statistical Efficiency of Mean-Field Reinforcement Learning with General Function Approximation J Huang, B Yardim, N He International Conference on Artificial Intelligence and Statistics, 289-297, 2024	7	2024
Model-Based RL for Mean-Field Games is not Statistically Harder than Single-Agent RL J Huang, N He, A Krause arXiv preprint arXiv:2402.05724, 2024	2	2024
Tiered Reinforcement Learning: Pessimism in the Face of Uncertainty and Constant Regret J Huang, L Zhao, T Qin, W Chen, N Jiang, TY Liu Advances in Neural Information Processing Systems 35, 2022	2	2022
Learning to Steer Markovian Agents under Model Uncertainty J Huang, V Thoma, Z Shen, HH Nax, N He arXiv preprint arXiv:2407.10207, 2024		2024
Robust Knowledge Transfer in Tiered Reinforcement Learning J Huang, N He Advances in Neural Information Processing Systems 36, 2023		2023

系统目前无法执行此操作，请稍后再试。

文章 1–12

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用