Reinforcement learning: An introduction RS Sutton, AG Barto MIT press, 2018 | 72262 | 2018 |
Introduction to reinforcement learning RS Sutton, AG Barto MIT press 135, 223-260, 1998 | 5828 | 1998 |
Neuronlike adaptive elements that can solve difficult learning control problems AG Barto, RS Sutton, CW Anderson IEEE transactions on systems, man, and cybernetics, 834-846, 1983 | 5051 | 1983 |
Toward a modern theory of adaptive networks: expectation and prediction. RS Sutton, AG Barto Psychological review 88 (2), 135, 1981 | 1821 | 1981 |
Recent advances in hierarchical reinforcement learning AG Barto, S Mahadevan Discrete event dynamic systems 13, 341-379, 2003 | 1740 | 2003 |
Learning to act using real-time dynamic programming AG Barto, SJ Bradtke, SP Singh Artificial intelligence 72 (1-2), 81-138, 1995 | 1653 | 1995 |
Introduction to reinforcement learning. Vol. 135 RS Sutton, AG Barto MIT press Cambridge 5, 21-22, 1998 | 1121 | 1998 |
Intrinsically motivated reinforcement learning N Chentanez, A Barto, S Singh Advances in neural information processing systems 17, 2004 | 1027 | 2004 |
Linear least-squares algorithms for temporal difference learning SJ Bradtke, AG Barto Machine learning 22 (1), 33-57, 1996 | 1002 | 1996 |
Handbook of learning and approximate dynamic programming J Si, AG Barto, WB Powell, D Wunsch John Wiley & Sons, 2004 | 974 | 2004 |
Improving elevator performance using reinforcement learning R Crites, A Barto Advances in neural information processing systems 8, 1995 | 899 | 1995 |
A model of how the basal ganglia generate and use neural signals that predict reinforcement JC Houk, JL Adams, AG Barto | 882 | 1994 |
Reinforcement learning is direct adaptive optimal control RS Sutton, AG Barto, RJ Williams IEEE control systems magazine 12 (2), 19-22, 1992 | 811 | 1992 |
Task decomposition through competition in a modular connectionist architecture: The what and where vision tasks RA Jacobs, MI Jordan, AG Barto Cognitive science 15 (2), 219-250, 1991 | 800 | 1991 |
Time-derivative models of Pavlovian reinforcement. RS Sutton, AG Barto The MIT Press, 1990 | 794 | 1990 |
Reinforcement Learning: An Introduction. By Richard’s Sutton AG Barto SIAM Rev 6 (2), 423, 2021 | 749 | 2021 |
Adaptive critics and the basal ganglia AG Barto | 727 | 1994 |
Reinforcement learning: an introduction MIT Press RS Sutton, AG Barto Cambridge, MA 22447, 10, 1998 | 661 | 1998 |
Learning and sequential decision making AG Barto, RS Sutton, C Watkins University of Massachusetts, 1989 | 661 | 1989 |
Automatic discovery of subgoals in reinforcement learning using diverse density A McGovern, AG Barto | 646 | 2001 |