acquisition of cooperative multi-agent tasks. We extend the MAXQ framework to the multi-
agent case. Each agent uses the same MAXQ hierarchy to decompose a task into sub-tasks.
Learning is decentralized, with each agent learning three interrelated skills: how to perform
subtasks, which order to do them in, and how to coordinate with other agents. Coordination
skills among agents are learned by using joint actions at the highest level (s) of the …