作者
Shanqi Liu, Licheng Wen, Jinhao Cui, Xuemeng Yang, Junjie Cao, Yong Liu
发表日期
2021/9/27
研讨会论文
2021 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)
页码范围
4777-4784
出版商
IEEE
简介
Multi-agent path finding in formation has many potential real-world applications like mobile warehouse robotics. However, previous multi-agent path finding (MAPF) methods hardly take formation into consideration. Further-more, they are usually centralized planners and require the whole state of the environment. Other decentralized partially observable approaches to MAPF are reinforcement learning (RL) methods. However, these RL methods encounter difficulties when learning path finding and formation problems at the same time. In this paper, we propose a novel decentralized partially observable RL algorithm that uses a hierarchical structure to decompose the multi-objective task into unrelated ones. It also calculates a theoretical weight that makes each tasks reward has equal influence on the final RL value function. Additionally, we introduce a communication method that helps agents cooperate with each …
引用总数
学术搜索中的文章
S Liu, L Wen, J Cui, X Yang, J Cao, Y Liu - 2021 IEEE/RSJ International Conference on Intelligent …, 2021