Benjamin Van Roy 个人学术档案

引用次数

	总计	2019 年至今
引用	18941	9881
h 指数	59	43
i10 指数	128	90

2100

1050

525

1575

199719981999200020012002200320042005200620072008200920102011201220132014201520162017201820192020202120222023202448 51 69 71 109 169 159 209 308 348 427 447 555 570 561 615 539 633 608 637 749 1000 1275 1673 1823 1984 2008 1115

开放获取的出版物数量

查看全部

5 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Ian OsbandOpenAI在 openai.com 的电子邮件经过验证
John TsitsiklisProfessor of Electrical Engineering, MIT在 mit.edu 的电子邮件经过验证
Zheng WenGoogle DeepMind在 google.com 的电子邮件经过验证
Daniel RussoColumbia University在 gsb.columbia.edu 的电子邮件经过验证
Gabriel Y WeintraubStanford GSB在 stanford.edu 的电子邮件经过验证
Ciamac MoallemiProfessor, Graduate School of Business, Columbia University在 gsb.columbia.edu 的电子邮件经过验证
Morteza IbrahimiStanford University在 stanford.edu 的电子邮件经过验证
Paat RusmevichientongProfessor, Marshall School of Business, University of Southern California在 marshall.usc.edu 的电子邮件经过验证
Vivek FariasMassachusetts Institute of Technology在 mit.edu 的电子邮件经过验证
Abbas KazerouniStanford University在 stanford.edu 的电子邮件经过验证
Anant SAHAIEECS, University of California, Berkeley在 eecs.berkeley.edu 的电子邮件经过验证
Alexander PritzelDeepmind在 google.com 的电子邮件经过验证
Charles BlundellResearch Scientist at DeepMind在 google.com 的电子邮件经过验证
Tsachy WeissmanProfessor of Electrical Engineering at Stanford University在 stanford.edu 的电子邮件经过验证
Yi-Hao KaoPhD Candidate, Electrical Engineering, Stanford University在 stanford.edu 的电子邮件经过验证
Hui ZhangCarnegie Mellon University, Conviva在 andrew.cmu.edu 的电子邮件经过验证
Per EngeProfessor, Stanford University在 stanford.edu 的电子邮件经过验证
Ramesh GovindanProfessor of Computer Science, University of Southern California在 usc.edu 的电子邮件经过验证
Ashish GoelProfessor of Management Science and Engineering, and by courtesy, Computer Science, Stanford University在 stanford.edu 的电子邮件经过验证
Paul CuffRenaissance Technologies在 rentec.com 的电子邮件经过验证

关注

Benjamin Van Roy

Stanford University

在 stanford.edu 的电子邮件经过验证 - 首页

reinforcement learning operations research information theory


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Analysis of temporal-diffference learning with function approximation J Tsitsiklis, B Van Roy Advances in neural information processing systems 9, 1996	2179	1996
Deep exploration via bootstrapped DQN I Osband, C Blundell, A Pritzel, B Van Roy Advances in neural information processing systems 29, 2016	1445	2016
A tutorial on thompson sampling D Russo, B Van Roy, A Kazerouni, I Osband, Z Wen Foundations and Trends in Machine Learning 11 (1), pp. 1-96, 2018	1106	2018
The linear programming approach to approximate dynamic programming DP De Farias, B Van Roy Operations research 51 (6), 850-865, 2003	968	2003
Regression methods for pricing complex American-style options JN Tsitsiklis, B Van Roy IEEE Transactions on Neural Networks 12 (4), 694-703, 2001	862	2001
Learning to optimize via posterior sampling D Russo, B Van Roy Mathematics of Operations Research 39 (4), 1221-1243, 2014	734	2014
Feature-based methods for large scale dynamic programming JN Tsitsiklis, B Van Roy Machine Learning 22 (1), 59-94, 1996	713	1996
Markov perfect industry dynamics with many firms G Weintraub, CL Benkard, B Van Roy Econometrica 76 (6), 1375-1411, 2008	567	2008
On constraint sampling in the linear programming approach to approximate dynamic programming DP De Farias, B Van Roy Mathematics of operations research 29 (3), 462-478, 2004	490	2004
Optimal stopping of Markov processes: Hilbert space theory, approximation algorithms, and an application to pricing high-dimensional financial derivatives JN Tsitsiklis, B Van Roy IEEE Transactions on Automatic Control 44 (10), 1840-1851, 1999	477	1999
An information-theoretic analysis of thompson sampling D Russo, B Van Roy Journal of Machine Learning Research 17 (68), 1-30, 2016	417	2016
Generalization and exploration via randomized value functions I Osband, B Van Roy, Z Wen International Conference on Machine Learning, 2377-2386, 2016	327	2016
Deep Exploration via Randomized Value Functions. I Osband, B Van Roy, DJ Russo, Z Wen The Journal of Machine Learning Research 20 (124), 1-62, 2019	326	2019
Consensus propagation CC Moallemi, B Van Roy IEEE Transactions on Information Theory 52 (11), 4753-4766, 2006	302	2006
Solving data mining problems through pattern recognition RL Kennedy, Y Lee, B Van Roy, CD Reed, RP Lippman Upper Saddle River, NJ: Prentice Hall PTR, 2011	269*	2011
Dynamic pricing with a prior on market response VF Farias, B Van Roy Operations Research 58 (1), 16-29, 2010	269	2010
Why is posterior sampling better than optimism for reinforcement learning? I Osband, B Van Roy International conference on machine learning, 2701-2710, 2017	268	2017
Eluder dimension and the sample complexity of optimistic exploration D Russo, B Van Roy Advances in Neural Information Processing Systems 26, 2013	258	2013
A neuro-dynamic programming approach to retailer inventory management B Van Roy, DP Bertsekas, Y Lee, JN Tsitsiklis Proceedings of the 36th IEEE Conference on Decision and Control 4, 4052-4057, 1997	240	1997
Learning to optimize via information-directed sampling D Russo, B Van Roy Advances in Neural Information Processing Systems 27, 2014	238	2014

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用