Brian Tanner 个人学术档案 - 学术资源搜索

引用次数

	总计	2019 年至今
引用	814	258
h 指数	11	7
i10 指数	11	7

2004200520062007200820092010201120122013201420152016201720182019202020212022202320247 31 16 33 29 26 39 57 60 51 44 36 36 27 52 37 41 51 56 38 35

关注

Brian Tanner

Research Engineer, DeepMind

在 google.com 的电子邮件经过验证

Reinforcement Learning


标题按引用次数排序按年份排序按标题排序	引用次数引用次数	年份
RL-Glue: Language-independent software for reinforcement-learning experiments B Tanner, A White The Journal of Machine Learning Research 10, 2133-2136, 2009	169	2009
Temporal-difference networks RS Sutton, B Tanner Advances in neural information processing systems 17, 2004	156	2004
Protecting against evaluation overfitting in empirical reinforcement learning S Whiteson, B Tanner, ME Taylor, P Stone 2011 IEEE symposium on adaptive dynamic programming and reinforcement …, 2011	151	2011
Using Predictive Representations to Improve Generalization in Reinforcement Learning. EJ Rafols, MB Ring, RS Sutton, B Tanner IJCAI, 835-840, 2005	67	2005
Hierarchical heuristic search revisited RC Holte, J Grajkowski, B Tanner International Symposium on Abstraction, Reformulation, and Approximation …, 2005	66	2005
Report on the 2008 reinforcement learning competition S Whiteson, B Tanner, A White AI Magazine 31 (2), 81-81, 2010	58	2010
Td (λ) networks: temporal-difference networks with eligibility traces B Tanner, RS Sutton Proceedings of the 22nd international conference on Machine learning, 888-895, 2005	39	2005
Temporal-Difference Networks with History. B Tanner, RS Sutton IJCAI, 865-870, 2005	32	2005
Reward-respecting subtasks for model-based reinforcement learning RS Sutton, MC Machado, GZ Holland, D Szepesvari, F Timbers, B Tanner, ... Artificial Intelligence 324, 104001, 2023	23	2023
Dynamic coalition formation in robotic soccer J Anderson, B Tanner, J Baltes Proceedings of the AAAI-04 Workshop on Forming and Maintaining Coalitions …, 2004	19	2004
Grounding Abstractions in Predictive State Representations. B Tanner, V Bulitko, A Koop, C Paduraru IJCAI, 1077-1082, 2007	14	2007
Reinforcement learning from teammates of varying skill in robotic soccer J Anderson, B Tanner, J Baltes FIRA Robot World Congress, 2004	5	2004
Forming and Maintaining Coalitions & Teams in Adaptive Multiagent Systems LK Soh, JE Anderson AAAI Workshop, San Jose CA, 2004	4	2004
Peer reinforcement in homogeneous and heterogeneous multi-agent learning J Anderson, B Tanner, R Wegner Proceedings of the IASTED International Conference on Artificial …, 2002	4	2002
Temporal-difference networks RS Sutton, B Tanner arXiv preprint arXiv:1504.05539, 2015	3	2015
Evaluating Agents using Social Choice Theory M Lanctot, K Larson, Y Bachrach, L Marris, Z Li, A Bhoopchand, ... arXiv preprint arXiv:2312.03121, 2023	2	2023
Exploiting opportunities through dynamic coalitions in robotic soccer J Anderson, R Wegner, B Tanner Proceedings of the AAAI International Workshop on Coalition Formation in …, 2002	2	2002
Reward-Respecting Subtasks for Model-Based Reinforcement Learning (Abstract Reprint) RS Sutton, MC Machado, GZ Holland, D Szepesvari, F Timbers, B Tanner, ... Proceedings of the AAAI Conference on Artificial Intelligence 38 (20), 22713 …, 2024		2024
Numerical Optimization: Project Report New Objectives for Predictive Representations B Tanner		2005
Name of Author: Brian Timothy Tanner Title of Thesis: Temporal-Difference Networks Degree: Master of Science Year this Degree Granted: 2005 BT Tanner		2005

系统目前无法执行此操作，请稍后再试。

文章 1–20

每年引用数

重复的引用

合并的引用

添加合著者合著作者

上传 PDF

关注此作者

引用次数

引用