Andrew Barto 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	121966	58754
h 指数	91	58
i10 指数	211	138

13000

6500

3250

9750

19901991199219931994199519961997199819992000200120022003200420052006200720082009201020112012201320142015201620172018201920202021202220232024392 478 506 498 604 588 568 758 797 871 996 1291 1317 1535 1816 2251 2357 2761 2946 2948 3173 3279 3395 3682 3705 3414 3840 4394 5968 7881 9542 11047 11772 12513 5990

开放获取的出版物数量

查看全部

11 篇文章

0 篇文章

可查看的文章

无法查看的文章

根据资助方的强制性开放获取政策

合著作者

Richard S. SuttonKeen, Amii, and University of Alberta在 richsutton.com 的电子邮件经过验证
George KonidarisBrown在 cs.brown.edu 的电子邮件经过验证
Charles Andersonprofessor of computer science, Colorado State University. Founder Pattern Exploration, LLC.在 colostate.edu 的电子邮件经过验证
Roderic GrupenUniversity of Massachusetts在 cs.umass.edu 的电子邮件经过验证
Scott KuindersmaSenior Director of Robotics Research at Boston Dynamics在 seas.harvard.edu 的电子邮件经过验证
Andrew FaggUniversity of Oklahoma在 cs.ou.edu 的电子邮件经过验证
Scott NiekumAssociate Professor, University of Massachusetts Amherst在 cs.umass.edu 的电子邮件经过验证
Özgür ŞimşekProfessor of Artificial Intelligence, University of Bath在 bath.ac.uk 的电子邮件经过验证
Steven BradtkeResearch Engineer, Vistronix, Inc.在 asrcfederal.com 的电子邮件经过验证
Sridhar MahadevanDirector, Data Science Lab, Adobe Research & Professor, University of Massachusetts, Amherst在 cs.umass.edu 的电子邮件经过验证
Richard L. LewisProfessor of Psychology, Linguistics and Cognitive Science, University of Michigan在 umich.edu 的电子邮件经过验证
Theodore J. PerkinsOttawa Hospital Research Institute在 ohri.ca 的电子邮件经过验证
Gianluca BaldassarreSenior Researcher, Institute of Cognitive Sciences and Tecnologies, Italian National Research在 istc.cnr.it 的电子邮件经过验证
Amy McGovernUniversity of Oklahoma在 ou.edu 的电子邮件经过验证
Balaraman RavindranProfessor of Computer Science, Indian Institute of Technology Madras在 cse.iitm.ac.in 的电子邮件经过验证
Sarah OsentoskiVinci4d在 vinci4d.ai 的电子邮件经过验证
Marco MirolliResearcher, Istituto di Scienze e Tecnologie della Cognizione, CNR, Italy在 istc.cnr.it 的电子邮件经过验证
Nuttapong ChentanezNVIDIA, Chulalongkorn University在 eecs.berkeley.edu 的电子邮件经过验证
Alicia WolfePostdoc, Wesleyan University在 wesleyan.edu 的电子邮件经过验证
Matthew BotvinickGoogle DeepMind, Yale Law School, University College London在 google.com 的电子邮件经过验证

关注

Andrew Barto

University of Massachusetts Amherst

在 cs.umass.edu 的电子邮件经过验证 - 首页

Reinforcement learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
Reinforcement learning: An introduction RS Sutton, AG Barto MIT press, 2018	72262	2018
Introduction to reinforcement learning RS Sutton, AG Barto MIT press 135, 223-260, 1998	5828	1998
Neuronlike adaptive elements that can solve difficult learning control problems AG Barto, RS Sutton, CW Anderson IEEE transactions on systems, man, and cybernetics, 834-846, 1983	5051	1983
Toward a modern theory of adaptive networks: expectation and prediction. RS Sutton, AG Barto Psychological review 88 (2), 135, 1981	1821	1981
Recent advances in hierarchical reinforcement learning AG Barto, S Mahadevan Discrete event dynamic systems 13, 341-379, 2003	1740	2003
Learning to act using real-time dynamic programming AG Barto, SJ Bradtke, SP Singh Artificial intelligence 72 (1-2), 81-138, 1995	1653	1995
Introduction to reinforcement learning. Vol. 135 RS Sutton, AG Barto MIT press Cambridge 5, 21-22, 1998	1121	1998
Intrinsically motivated reinforcement learning N Chentanez, A Barto, S Singh Advances in neural information processing systems 17, 2004	1027	2004
Linear least-squares algorithms for temporal difference learning SJ Bradtke, AG Barto Machine learning 22 (1), 33-57, 1996	1002	1996
Handbook of learning and approximate dynamic programming J Si, AG Barto, WB Powell, D Wunsch John Wiley & Sons, 2004	974	2004
Improving elevator performance using reinforcement learning R Crites, A Barto Advances in neural information processing systems 8, 1995	899	1995
A model of how the basal ganglia generate and use neural signals that predict reinforcement JC Houk, JL Adams, AG Barto	882	1994
Reinforcement learning is direct adaptive optimal control RS Sutton, AG Barto, RJ Williams IEEE control systems magazine 12 (2), 19-22, 1992	811	1992
Task decomposition through competition in a modular connectionist architecture: The what and where vision tasks RA Jacobs, MI Jordan, AG Barto Cognitive science 15 (2), 219-250, 1991	800	1991
Time-derivative models of Pavlovian reinforcement. RS Sutton, AG Barto The MIT Press, 1990	794	1990
Reinforcement Learning: An Introduction. By Richard’s Sutton AG Barto SIAM Rev 6 (2), 423, 2021	749	2021
Adaptive critics and the basal ganglia AG Barto	727	1994
Reinforcement learning: an introduction MIT Press RS Sutton, AG Barto Cambridge, MA 22447, 10, 1998	661	1998
Learning and sequential decision making AG Barto, RS Sutton, C Watkins University of Massachusetts, 1989	661	1989
Automatic discovery of subgoals in reinforcement learning using diverse density A McGovern, AG Barto	646	2001

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

合著作者

引用