[PDF][PDF] Guiding a reinforcement learner with natural language advice: Initial results in RoboCup soccer

G Kuhlmann, P Stone, R Mooney… - The AAAI-2004 workshop …, 2004 - cs.wisc.edu
We describe our current efforts towards creating a reinforcement learner that learns both
from reinforcements provided by its environment and from human-generated advice. Our …

Situation based strategic positioning for coordinating a team of homogeneous agents

M Hannebauer, J Wendler, E Pagello, LP Reis… - Balancing Reactivity and …, 2001 - Springer
In this paper we are proposing an approach for coordinating a team of homogeneous agents
based on a flexible common Team Strategy as well as on the concepts of Situation Based …

Behavior transfer for value-function-based reinforcement learning

ME Taylor, P Stone - Proceedings of the fourth international joint …, 2005 - dl.acm.org
Temporal difference (TD) learning methods [22] have become popular reinforcement
learning techniques in recent years. TD methods have had some experimental successes …

Helios base: An open source package for the robocup soccer 2d simulation

H Akiyama, T Nakashima - RoboCup 2013: Robot World Cup XVII 17, 2014 - Springer
To promote the research of multiagent systems, several base codes have been released for
the RoboCup soccer 2D simulation community. As described herein, we present HELIOS …

[PDF][PDF] The incremental development of a synthetic multi-agent system: The uva trilearn 2001 robotic soccer simulation team

R de Boer, J Kok - 2002 - staff.science.uva.nl
This thesis describes the incremental development and main features of a synthetic multi-
agent system called UvA Trilearn 2001. UvA Trilearn 2001 is a robotic soccer simulation …

Cyrus2D Base: Source Code Base for RoboCup 2D Soccer Simulation League

N Zare, O Amini, A Sayareh, M Sarvmaili… - Robot World Cup, 2022 - Springer
Abstract Soccer Simulation 2D League is one of the major leagues of RoboCup
competitions. In a Soccer Simulation 2D (SS2D) game, two teams of 11 players and one …

[PDF][PDF] Value functions for RL-based behavior transfer: A comparative study

ME Taylor, P Stone, Y Liu - AAAI, 2005 - cdn.aaai.org
Temporal difference (TD) learning methods (Sutton & Barto 1998) have become popular
reinforcement learning techniques in recent years. TD methods, relying on function …

[PDF][PDF] Helios2013 team description paper

H Akiyama, T Nakashima, K Yamashita, S Mifune - RoboCup, 2013 - archive.robocup.info
This team description paper presents an overview of previous work and recent research
topics of Team HELIOS2023. We have been attempting to construct a method that facilitates …

Gliders2d: source code base for RoboCup 2D soccer simulation league

M Prokopenko, P Wang - RoboCup 2019: Robot World Cup XXIII 23, 2019 - Springer
We describe Gliders2d, a base code release for Gliders, a soccer simulation team which
won the RoboCup Soccer 2D Simulation League in 2016. We trace six evolutionary steps …

[PDF][PDF] Defining and using ideal teammate and opponent agent models

P Stone, P Riley, M Veloso - AAAI/IAAI, 2000 - cdn.aaai.org
A common challenge for agents in multiagent systems is trying to predict what other agents
are going to do in the future. Such knowledge can help an agent determine which of its …