Deep deterministic policy gradient algorithm for crowd-evacuation path planning

X Li, H Liu, J Li, Y Li - Computers & Industrial Engineering, 2021 - Elsevier
X Li, H Liu, J Li, Y Li
Computers & Industrial Engineering, 2021Elsevier
In existing evacuation methods, the large number of pedestrians and the complex
environment will affect the efficiency of evacuation. Therefore, we propose a hierarchical
evacuation method based on multi-agent deep reinforcement learning (MADRL) to solve the
above problem. First, we use a two-level evacuation mechanism to guide evacuations, the
crowd is divided into leaders and followers. Second, in the upper level, leaders perform path
planning to guide the evacuation. To obtain the best evacuation path, we propose the …
Abstract
In existing evacuation methods, the large number of pedestrians and the complex environment will affect the efficiency of evacuation. Therefore, we propose a hierarchical evacuation method based on multi-agent deep reinforcement learning (MADRL) to solve the above problem. First, we use a two-level evacuation mechanism to guide evacuations, the crowd is divided into leaders and followers. Second, in the upper level, leaders perform path planning to guide the evacuation. To obtain the best evacuation path, we propose the efficient multi-agent deep deterministic policy gradient (E-MADDPG) algorithm for crowd-evacuation path planning. E-MADDPG algorithm combines learning curves to improve the fixed experience pool of MADDPG algorithm and uses high-priority experience playback strategy to improve the sampling strategy. The improvement increases the learning efficiency of the algorithm. Meanwhile we extract pedestrian motion trajectories from real motion videos to reduce the state space of algorithm. Third, in the bottom layer, followers use the relative velocity obstacle (RVO) algorithm to avoid collisions and follow leaders to evacuate. Finally, experimental results illustrate that the E-MADDPG algorithm can improve path planning efficiency, while the proposed method can improve the efficiency of crowd evacuation.
Elsevier
以上显示的是最相近的搜索结果。 查看全部搜索结果