Optimizing taxi carpool policies via reinforcement learning and spatio-temporal mining

I Jindal, ZT Qin, X Chen, M Nokleby… - 2018 IEEE International …, 2018 - ieeexplore.ieee.org
2018 IEEE International Conference on Big Data (Big Data), 2018ieeexplore.ieee.org
In this paper, we develop a reinforcement learning (RL) based system to learn an effective
policy for carpooling that maximizes transportation efficiency so that fewer cars are required
to fulfill the given amount of trip demand. For this purpose, first, we develop a deep neural
network model, called ST-NN (Spatio-Temporal Neural Network), to predict taxi trip time from
the raw GPS trip data. Secondly, we develop a carpooling simulation environment for RL
training, with the output of ST-NN and using the NYC taxi trip dataset. In order to maximize …
In this paper, we develop a reinforcement learning (RL) based system to learn an effective policy for carpooling that maximizes transportation efficiency so that fewer cars are required to fulfill the given amount of trip demand. For this purpose, first, we develop a deep neural network model, called ST-NN (Spatio-Temporal Neural Network), to predict taxi trip time from the raw GPS trip data. Secondly, we develop a carpooling simulation environment for RL training, with the output of ST-NN and using the NYC taxi trip dataset. In order to maximize transportation efficiency and minimize traffic congestion, we choose the effective distance covered by the driver on a carpool trip as the reward. Therefore, the more effective distance a driver achieves over a trip (i.e. to satisfy more trip demand) the higher the efficiency and the less will be the traffic congestion. We compared the performance of RL learned policy to a fixed policy (which always accepts carpool) as a baseline and obtained promising results that are interpretable and demonstrate the advantage of our RL approach. We also compare the performance of ST-NN to that of state-of-the-art travel time estimation methods and observe that ST-NN significantly improves the prediction performance and is more robust to outliers.
ieeexplore.ieee.org
以上显示的是最相近的搜索结果。 查看全部搜索结果