作者
Jose Luis Vazquez Espinoza, Alexander Liniger, Wilko Schwarting, Daniela Rus, Luc Van Gool
发表日期
2022/5/11
研讨会论文
Learning for Dynamics and Control Conference
页码范围
1006-1019
出版商
PMLR
简介
In most classical Autonomous Vehicle (AV) stacks, the prediction and planning layers are separated, limiting the planner to react to predictions that are not informed by the planned trajectory of the AV. This work presents a module that tightly couples these layers via a game-theoretic Model Predictive Controller (MPC) that uses a novel interactive multi-agent neural network policy as part of its predictive model. In our setting, the MPC planner considers all the surrounding agents by informing the multi-agent policy with the planned state sequence. Fundamental to the success of our method is the design of a novel multi-agent policy network that can steer a vehicle given the state of the surrounding agents and the map information. The policy network is trained implicitly with ground-truth observation data using backpropagation through time and a differentiable dynamics model to roll out the trajectory forward in time. Finally, we show that our multi-agent policy network learns to drive while interacting with the environment, and, when combined with the game-theoretic MPC planner, can successfully generate interactive behaviors.
引用总数
学术搜索中的文章
JLV Espinoza, A Liniger, W Schwarting, D Rus… - Learning for Dynamics and Control Conference, 2022