Chen-Yu Wei 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	1879	1857
h 指数	23	23
i10 指数	31	31

500

250

125

375

201820192020202120222023202417 70 144 326 415 500 399

开放获取的出版物数量

查看全部

22 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Haipeng LuoAssociate Professor, University of Southern California在 usc.edu 的电子邮件经过验证
Chung-Wei LeeUniversity of Southern California在 usc.edu 的电子邮件经过验证
Julian ZimmertGoogle Research在 google.com 的电子邮件经过验证
Mengxiao ZhangPh.D. student, University of Southern California在 usc.edu 的电子邮件经过验证
John LangfordMicrosoft Research New York在 hunch.net 的电子邮件经过验证
Alekh AgarwalGoogle在 google.com 的电子邮件经过验证
Christoph DannResearch Scientist, Google在 google.com 的电子邮件经过验证
Chi-Jen LuResearch Fellow, Institute of Information Science, Academia Sinica在 iis.sinica.edu.tw 的电子邮件经过验证
Rahul JainProfessor of ECE, CS and ISE, University of Southern California在 usc.edu 的电子邮件经过验证
Mehdi Jafarnia JahromiDeepMind在 google.com 的电子邮件经过验证
Liyu ChenUniversity of Southern California在 usc.edu 的电子邮件经过验证
Kaiqing ZhangAssistant Professor, University of Maryland, College Park在 umd.edu 的电子邮件经过验证
Dongsheng DingUniversity of Pennsylvania在 seas.upenn.edu 的电子邮件经过验证
Weiqiang ZhengYale University在 yale.edu 的电子邮件经过验证
Yang CaiAssociate Professor of Computer Science and Economics, Yale University在 yale.edu 的电子邮件经过验证
Hiteshi SharmaMicrosoft在 microsoft.com 的电子邮件经过验证
Alberto BiettiFlatiron Institute, Simons Foundation在 nyu.edu 的电子邮件经过验证
Miroslav DudikMicrosoft Research在 microsoft.com 的电子邮件经过验证
Zhiwei Steven WuCarnegie Mellon University在 andrew.cmu.edu 的电子邮件经过验证
Mihailo R. JovanovicProfessor of Electrical and Computer Engineering, University of Southern California在 usc.edu 的电子邮件经过验证

关注

Chen-Yu Wei

Assistant Professor, University of Virginia

在 virginia.edu 的电子邮件经过验证 - 首页


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
More adaptive algorithms for adversarial bandits CY Wei, H Luo Conference On Learning Theory, 1263-1291, 2018	164	2018
Online reinforcement learning in stochastic games CY Wei, YT Hong, CJ Lu Advances in Neural Information Processing Systems 30, 2017	137	2017
A new algorithm for non-stationary contextual bandits: Efficient, optimal and parameter-free Y Chen, CW Lee, H Luo, CY Wei Conference on Learning Theory, 696-726, 2019	127	2019
Efficient contextual bandits in non-stationary worlds H Luo, CY Wei, A Agarwal, J Langford Conference On Learning Theory, 1739-1776, 2018	125	2018
Linear last-iterate convergence in constrained saddle-point optimization CY Wei, CW Lee, M Zhang, H Luo International Conference on Learning Representations, 2021	118*	2021
Model-free reinforcement learning in infinite-horizon average-reward markov decision processes CY Wei, MJ Jahromi, H Luo, H Sharma, R Jain International conference on machine learning, 10170-10180, 2020	97	2020
Last-iterate convergence of decentralized optimistic gradient descent/ascent in infinite-horizon competitive Markov games CY Wei, CW Lee, M Zhang, H Luo Conference on learning theory, 4259-4299, 2021	96	2021
Non-stationary reinforcement learning without prior knowledge: An optimal black-box approach CY Wei, H Luo Conference on learning theory, 4300-4354, 2021	95	2021
Beating stochastic and adversarial semi-bandits optimally and simultaneously J Zimmert, H Luo, CY Wei International Conference on Machine Learning, 7683-7692, 2019	86	2019
Tracking the best expert in non-stationary stochastic environments CY Wei, YT Hong, CJ Lu Advances in neural information processing systems 29, 2016	67	2016
Independent policy gradient for large-scale markov potential games: Sharper rates, function approximation, and game-agnostic convergence D Ding, CY Wei, K Zhang, M Jovanovic International Conference on Machine Learning, 5166-5220, 2022	66	2022
Learning infinite-horizon average-reward mdps with linear function approximation CY Wei, MJ Jahromi, H Luo, R Jain International Conference on Artificial Intelligence and Statistics, 3007-3015, 2021	56	2021
Bias no more: high-probability data-dependent regret bounds for adversarial bandits and mdps CW Lee, H Luo, CY Wei, M Zhang Advances in neural information processing systems 33, 15522-15533, 2020	55	2020
Efficient online portfolio with logarithmic regret H Luo, CY Wei, K Zheng Advances in neural information processing systems 31, 2018	55	2018
Improved path-length regret bounds for bandits S Bubeck, Y Li, H Luo, CY Wei Conference On Learning Theory, 508-528, 2019	51	2019
A model selection approach for corruption robust reinforcement learning CY Wei, C Dann, J Zimmert International Conference on Algorithmic Learning Theory, 1043-1096, 2022	49	2022
Federated residual learning A Agarwal, J Langford, CY Wei arXiv preprint arXiv:2003.12880, 2020	47	2020
Achieving near instance-optimality and minimax-optimality in stochastic and adversarial linear bandits simultaneously CW Lee, H Luo, CY Wei, M Zhang, X Zhang International Conference on Machine Learning, 6142-6151, 2021	46	2021
Impossible tuning made possible: A new expert algorithm and its applications L Chen, H Luo, CY Wei Conference on Learning Theory, 1216-1259, 2021	45	2021
Policy optimization in adversarial mdps: Improved exploration via dilated bonuses H Luo, CY Wei, CW Lee Advances in Neural Information Processing Systems 34, 22931-22942, 2021	44	2021

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用