RL-Glue: Language-independent software for reinforcement-learning experiments B Tanner, A White The Journal of Machine Learning Research 10, 2133-2136, 2009 | 169 | 2009 |
Temporal-difference networks RS Sutton, B Tanner Advances in neural information processing systems 17, 2004 | 156 | 2004 |
Protecting against evaluation overfitting in empirical reinforcement learning S Whiteson, B Tanner, ME Taylor, P Stone 2011 IEEE symposium on adaptive dynamic programming and reinforcement …, 2011 | 151 | 2011 |
Using Predictive Representations to Improve Generalization in Reinforcement Learning. EJ Rafols, MB Ring, RS Sutton, B Tanner IJCAI, 835-840, 2005 | 67 | 2005 |
Hierarchical heuristic search revisited RC Holte, J Grajkowski, B Tanner International Symposium on Abstraction, Reformulation, and Approximation …, 2005 | 66 | 2005 |
Report on the 2008 reinforcement learning competition S Whiteson, B Tanner, A White AI Magazine 31 (2), 81-81, 2010 | 58 | 2010 |
Td (λ) networks: temporal-difference networks with eligibility traces B Tanner, RS Sutton Proceedings of the 22nd international conference on Machine learning, 888-895, 2005 | 39 | 2005 |
Temporal-Difference Networks with History. B Tanner, RS Sutton IJCAI, 865-870, 2005 | 32 | 2005 |
Reward-respecting subtasks for model-based reinforcement learning RS Sutton, MC Machado, GZ Holland, D Szepesvari, F Timbers, B Tanner, ... Artificial Intelligence 324, 104001, 2023 | 23 | 2023 |
Dynamic coalition formation in robotic soccer J Anderson, B Tanner, J Baltes Proceedings of the AAAI-04 Workshop on Forming and Maintaining Coalitions …, 2004 | 19 | 2004 |
Grounding Abstractions in Predictive State Representations. B Tanner, V Bulitko, A Koop, C Paduraru IJCAI, 1077-1082, 2007 | 14 | 2007 |
Reinforcement learning from teammates of varying skill in robotic soccer J Anderson, B Tanner, J Baltes FIRA Robot World Congress, 2004 | 5 | 2004 |
Forming and Maintaining Coalitions & Teams in Adaptive Multiagent Systems LK Soh, JE Anderson AAAI Workshop, San Jose CA, 2004 | 4 | 2004 |
Peer reinforcement in homogeneous and heterogeneous multi-agent learning J Anderson, B Tanner, R Wegner Proceedings of the IASTED International Conference on Artificial …, 2002 | 4 | 2002 |
Temporal-difference networks RS Sutton, B Tanner arXiv preprint arXiv:1504.05539, 2015 | 3 | 2015 |
Evaluating Agents using Social Choice Theory M Lanctot, K Larson, Y Bachrach, L Marris, Z Li, A Bhoopchand, ... arXiv preprint arXiv:2312.03121, 2023 | 2 | 2023 |
Exploiting opportunities through dynamic coalitions in robotic soccer J Anderson, R Wegner, B Tanner Proceedings of the AAAI International Workshop on Coalition Formation in …, 2002 | 2 | 2002 |
Reward-Respecting Subtasks for Model-Based Reinforcement Learning (Abstract Reprint) RS Sutton, MC Machado, GZ Holland, D Szepesvari, F Timbers, B Tanner, ... Proceedings of the AAAI Conference on Artificial Intelligence 38 (20), 22713 …, 2024 | | 2024 |
Numerical Optimization: Project Report New Objectives for Predictive Representations B Tanner | | 2005 |
Name of Author: Brian Timothy Tanner Title of Thesis: Temporal-Difference Networks Degree: Master of Science Year this Degree Granted: 2005 BT Tanner | | 2005 |