Shalabh Bhatnagar 个人学术档案

引用次数

	总计	2019 年至今
引用	7258	3927
h 指数	35	26
i10 指数	94	54

980

490

245

735

200320042005200620072008200920102011201220132014201520162017201820192020202120222023202427 30 58 71 68 64 75 128 231 224 252 279 277 293 303 419 514 526 726 733 967 458

开放获取的出版物数量

查看全部

35 篇文章

9 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

关注

Shalabh Bhatnagar

Professor in the Department of Computer Science and Automation, Indian Institute of Science

在 iisc.ac.in 的电子邮件经过验证 - 首页

Stochastic systems control simulation optimization


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Incremental natural actor-critic algorithms S Bhatnagar, M Ghavamzadeh, M Lee, RS Sutton Advances in neural information processing systems 20, 2007	1068	2007
Fast gradient-descent methods for temporal-difference learning with linear function approximation RS Sutton, HR Maei, D Precup, S Bhatnagar, D Silver, C Szepesvári, ... Proceedings of the 26th annual international conference on machine learning …, 2009	713	2009
Stochastic Recursive Algorithms for Optimization: Simultaneous Perturbation Methods HLPLAP S.Bhatnagar Stochastic Recursive Algorithms for Optimization: Simultaneous Perturbation …, 2013	415*	2013
Reinforcement learning with function approximation for traffic signal control LA Prashanth, S Bhatnagar IEEE Transactions on Intelligent Transportation Systems 12 (2), 412-421, 2010	381	2010
Convergent temporal-difference learning with arbitrary smooth function approximation H Maei, C Szepesvari, S Bhatnagar, D Precup, D Silver, RS Sutton Advances in neural information processing systems 22, 2009	340	2009
Toward off-policy learning control with function approximation. HR Maei, C Szepesvári, S Bhatnagar, RS Sutton ICML 10, 719-726, 2010	333	2010
An online actor–critic algorithm with function approximation for constrained markov decision processes S Bhatnagar, K Lakshmanan Journal of Optimization Theory and Applications 153, 688-708, 2012	284	2012
An actor–critic algorithm with function approximation for discounted cost constrained Markov decision processes S Bhatnagar Systems & Control Letters 59 (12), 760-766, 2010	226	2010
Memory-based deep reinforcement learning for obstacle avoidance in UAV with limited environment knowledge A Singla, S Padakandla, S Bhatnagar IEEE transactions on intelligent transportation systems 22 (1), 107-118, 2019	217	2019
Reinforcement learning algorithm for non-stationary environments S Padakandla, P KJ, S Bhatnagar Applied Intelligence 50 (11), 3590-3606, 2020	136	2020
Two-timescale simultaneous perturbation stochastic approximation using deterministic perturbation sequences S Bhatnagar, MC Fu, SI Marcus, IJ Wang ACM Transactions on Modeling and Computer Simulation (TOMACS) 13 (2), 180-209, 2003	116	2003
Multi-agent reinforcement learning for traffic signal control KJ Prabuchandran, HK AN, S Bhatnagar 17th International IEEE Conference on Intelligent Transportation Systems …, 2014	114	2014
Reinforcement learning with average cost for adaptive control of traffic lights at intersections LA Prashanth, S Bhatnagar 2011 14th International IEEE Conference on Intelligent Transportation …, 2011	89	2011
A time aggregation approach to Markov decision processes XR Cao, Z Ren, S Bhatnagar, M Fu, S Marcus Automatica 38 (6), 929-943, 2002	89	2002
Adaptive multivariate three-timescale stochastic approximation algorithms for simulation based optimization S Bhatnagar ACM Transactions on Modeling and Computer Simulation (TOMACS) 15 (1), 74-107, 2005	79	2005
Two-timescale algorithms for learning Nash equilibria in general-sum stochastic games HL Prasad, P LA, S Bhatnagar Proceedings of the 2015 International Conference on Autonomous Agents and …, 2015	70	2015
Two time-scale stochastic approximation with controlled Markov noise and off-policy temporal-difference learning P Karmakar, S Bhatnagar Mathematics of Operations Research 43 (1), 130-151, 2018	69	2018
Adaptive Newton-based multivariate smoothed functional algorithms for simulation optimization S Bhatnagar ACM Transactions on Modeling and Computer Simulation (TOMACS) 18 (1), 1-35, 2007	68	2007
Two-timescale algorithms for simulation optimization of hidden Markov models S Bhatnagar, MC Fu, SI Marcus, S Bhatnagar Iie Transactions 33 (3), 245-258, 2001	59	2001
A Simultaneous Deterministic Perturbation Actor-Critic Algorithm with an Application to Optimal Mortgage Refinancing VLR Chinthalapati, S Bhatnagar Proceedings of the 45th IEEE Conference on Decision and Control, 4151-4156, 2006	58	2006

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

引用