Chapters from Simulation-Based Optimization A Gosavi Simulation-Based Optimization, 2015 | 868* | 2015 |
Reinforcement learning: A tutorial survey and recent advances A Gosavi INFORMS Journal on Computing 21 (2), 178-192, 2009 | 416 | 2009 |
Solving semi-Markov decision problems using average reward reinforcement learning TK Das, A Gosavi, S Mahadevan, N Marchalleck Management Science 45 (4), 560-574, 1999 | 306 | 1999 |
Self-improving factory simulation using continuous-time average-reward reinforcement learning S Mahadevan, N Marchalleck, TK Das, A Gosavi MACHINE LEARNING-INTERNATIONAL WORKSHOP THEN CONFERENCE-, 202-210, 1997 | 195 | 1997 |
Joint optimization of product price, display orientation and shelf-space allocation in retail category management CC Murray, D Talukdar, A Gosavi Journal of retailing 86 (2), 125-136, 2010 | 183 | 2010 |
A reinforcement learning approach to a single leg airline revenue management problem with multiple fare classes and overbooking A Gosavii, N Bandla, TK Das IIE transactions 34 (9), 729-742, 2002 | 178 | 2002 |
Reinforcement learning for long-run average cost A Gosavi European journal of operational research 155 (3), 654-674, 2004 | 168 | 2004 |
A risk-sensitive approach to total productive maintenance A Gosavi Automatica 42 (8), 1321-1330, 2006 | 158 | 2006 |
Global supply chain management: a reinforcement learning approach P Pontrandolfo, A Gosavi, OG Okogbaa, TK Das International Journal of Production Research 40 (6), 1299-1317, 2002 | 133 | 2002 |
A reinforcement learning algorithm based on policy iteration for average reward: Empirical results with yield management and convergence analysis A Gosavi Machine Learning 55, 5-29, 2004 | 120 | 2004 |
A reinforcement learning approach for inventory replenishment in vendor-managed inventory systems with consignment inventory Z Sui, A Gosavi, L Lin Engineering Management Journal 22 (4), 44-53, 2010 | 89 | 2010 |
Simulation optimization for revenue management of airlines with cancellations and overbooking A Gosavi, E Ozkaya, AF Kahraman OR spectrum 29, 21-38, 2007 | 88 | 2007 |
Economic design of dual-sampling-interval policies for X charts with and without run rules TK Das, V JAIN, A GOSAVI IIE transactions 29 (6), 497-506, 1997 | 63 | 1997 |
The multi-product price-setting newsvendor with resource capacity constraints CC Murray, A Gosavi, D Talukdar International Journal of Production Economics 138 (1), 148-158, 2012 | 51 | 2012 |
Flexible and intelligent learning architectures for SoS (FILA-SoS): Architectural evolution in systems-of-systems S Agarwal, LE Pape, CH Dagli, NK Ergin, D Enke, A Gosavi, R Qin, ... Procedia Computer Science 44, 76-85, 2015 | 46 | 2015 |
Form errors in precision metrology: a survey of measurement techniques A Gosavi, E Cudney Quality Engineering 24 (3), 369-380, 2012 | 43 | 2012 |
Boundedness of iterates in Q-learning A Gosavi Systems & control letters 55 (4), 347-349, 2006 | 43 | 2006 |
Automated tuning of optimization software parameters M Baz, B Hunsaker, P Brooks, A Gosavi Technical Report TR2007-7, University of Pittsburgh, Department of …, 2007 | 41 | 2007 |
A tutorial for reinforcement learning A Gosavi The State University of New York at Buffalo, 2017 | 34 | 2017 |
Analyzing Responses from Likert Surveys and Risk-adjusted Ranking: A Data Analytics Perspective. A Gosavi Complex Adaptive Systems 61, 24-31, 2015 | 34 | 2015 |