Zheng Wen 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	5517	4701
h 指数	31	30
i10 指数	60	55

1000

500

250

750

2014201520162017201820192020202120222023202428 67 146 179 329 505 742 850 992 917 694

开放获取的出版物数量

查看全部

8 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Branislav KvetonAmazon在 amazon.com 的电子邮件经过验证
Benjamin Van RoyStanford University在 stanford.edu 的电子邮件经过验证
Ian OsbandOpenAI在 openai.com 的电子邮件经过验证
Csaba SzepesvariDeepMind & University of Alberta在 cs.ualberta.ca 的电子邮件经过验证
Azin AshkanGoogle在 uwaterloo.ca 的电子邮件经过验证
Xiuyuan LuGoogle DeepMind在 google.com 的电子邮件经过验证
Yasin Abbasi YadkoriGoogle DeepMind在 google.com 的电子邮件经过验证
Vikranth DwaracherlaDeepMind在 google.com 的电子邮件经过验证
Morteza IbrahimiStanford University在 stanford.edu 的电子邮件经过验证
Mohammad GhavamzadehAmazon在 amazon.com 的电子邮件经过验证
Sharan VaswaniSimon Fraser University在 sfu.ca 的电子邮件经过验证
Daniel RussoColumbia University在 gsb.columbia.edu 的电子邮件经过验证
Botao HaoOpenAI在 openai.com 的电子邮件经过验证
Seyed Mohammad AsghariResearch Engineer, DeepMind在 google.com 的电子邮件经过验证
Michal ValkoLlama @ Meta Paris & Inria & MVA - Ex: Gemini and BYOL @ Google DeepMind在 meta.com 的电子邮件经过验证
Brian ErikssonAdobe在 adobe.com 的电子邮件经过验证
S MuthukrishnanRutgers Univ在 cs.rutgers.edu 的电子邮件经过验证
Sumeet KatariyaAmazon在 wisc.edu 的电子邮件经过验证
Shlomo BerkovskyMacquarie University在 mq.edu.au 的电子邮件经过验证
Abbas KazerouniStanford University在 stanford.edu 的电子邮件经过验证

关注

Zheng Wen

Google DeepMind

在 google.com 的电子邮件经过验证 - 首页

Artificial Intelligence Reinforcement Learning Operations Research Large Language Models


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
A Tutorial on Thompson Sampling D Russo, B Van Roy, A Kazerouni, I Osband, Z Wen arXiv, https://arxiv.org/pdf/1707.02038.pdf, 0	1147*
Generalization and exploration via randomized value functions I Osband, B Van Roy, Z Wen International Conference on Machine Learning, 2377-2386, 2016	343	2016
Deep exploration via randomized value functions I Osband, B Van Roy, DJ Russo, Z Wen Journal of Machine Learning Research 20 (124), 1-62, 2019	337	2019
Tight Regret Bounds for Stochastic Combinatorial Semi-Bandits B Kveton, Z Wen, A Ashkan, C Szepesvari International Conference on Artificial Intelligence and Statistics (AISTATS …, 2014	323	2014
Cascading bandits: Learning to rank in the cascade model B Kveton, C Szepesvári, Z Wen, A Ashkan ICML, 2015	318	2015
Optimal demand response using device based reinforcement learning Z Wen, D O'Neill, HR Maei IEEE Transactions on Smart Grid, 2014	314	2014
Online influence maximization under independent cascade model with semi-bandit feedback Z Wen, B Kveton, M Valko, S Vaswani Advances in neural information processing systems 30, 2017	148*	2017
Nearly optimal adaptive procedure with change detection for piecewise-stationary bandit Y Cao, Z Wen, B Kveton, Y Xie The 22nd International Conference on Artificial Intelligence and Statistics …, 2019	141*	2019
Matroid bandits: Fast combinatorial optimization with learning B Kveton, Z Wen, A Ashkan, H Eydgahi, B Eriksson UAI 2014, 2014	130	2014
Cascading bandits for large-scale recommendation problems S Zong, H Ni, K Sung, NR Ke, Z Wen, B Kveton arXiv preprint arXiv:1603.05359, 2016	129	2016
Combinatorial cascading bandits B Kveton, Z Wen, A Ashkan, C Szepesvari Advances in Neural Information Processing Systems 28, 2015	129	2015
Optimal Greedy Diversity for Recommendation A Ashkan, B Kveton, S Berkovsky, Z Wen	117	2015
Efficient learning in large-scale combinatorial semi-bandits Z Wen, B Kveton, A Ashkan http://jmlr.org/proceedings/papers/v37/wen15.html, 2014	109	2014
Online learning to rank in stochastic click models M Zoghi, T Tunys, M Ghavamzadeh, B Kveton, C Szepesvari, Z Wen International conference on machine learning, 4199-4208, 2017	107	2017
Epistemic neural networks I Osband, Z Wen, SM Asghari, V Dwaracherla, M Ibrahimi, X Lu, ... Advances in Neural Information Processing Systems 36, 2024	97	2024
Efficient Exploration and Value Function Generalization in Deterministic Systems Z Wen, B Van Roy Advances in Neural Information Processing Systems, 3021--3029, 2013	91	2013
DCM Bandits: Learning to Rank with Multiple Clicks S Katariya, B Kveton, C Szepesvári, Z Wen arXiv, 2016	90	2016
Model-independent online learning for influence maximization S Vaswani, B Kveton, Z Wen, M Ghavamzadeh, LVS Lakshmanan, ... International conference on machine learning, 3530-3539, 2017	82*	2017
Garbage in, reward out: Bootstrapping exploration in multi-armed bandits B Kveton, C Szepesvari, S Vaswani, Z Wen, T Lattimore, M Ghavamzadeh International Conference on Machine Learning, 3601-3610, 2019	78	2019
Stochastic rank-1 bandits S Katariya, B Kveton, C Szepesvari, C Vernade, Z Wen Artificial Intelligence and Statistics, 392-401, 2017	74	2017

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用